Serveur d'exploration MERS

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Sequence characteristics define trade-offs between on-target and genome-wide off-target hybridization of oligoprobes

Identifieur interne : 001045 ( Pmc/Corpus ); précédent : 001044; suivant : 001046

Sequence characteristics define trade-offs between on-target and genome-wide off-target hybridization of oligoprobes

Auteurs : Olga V. Matveeva ; Aleksey Y. Ogurtsov ; Nafisa N. Nazipova ; Svetlana A. Shabalina

Source :

RBID : PMC:6013149

Abstract

Off-target oligoprobe’s interaction with partially complementary nucleotide sequences represents a problem for many bio-techniques. The goal of the study was to identify oligoprobe sequence characteristics that control the ratio between on-target and off-target hybridization. To understand the complex interplay between specific and genome-wide off-target (cross-hybridization) signals, we analyzed a database derived from genomic comparison hybridization experiments performed with an Affymetrix tiling array. The database included two types of probes with signals derived from (i) a combination of specific signal and cross-hybridization and (ii) genomic cross-hybridization only. All probes from the database were grouped into bins according to their sequence characteristics, where both hybridization signals were averaged separately. For selection of specific probes, we analyzed the following sequence characteristics: vulnerability to self-folding, nucleotide composition bias, numbers of G nucleotides and GGG-blocks, and occurrence of probe’s k-mers in the human genome. Increases in bin ranges for these characteristics are simultaneously accompanied by a decrease in hybridization specificity—the ratio between specific and cross-hybridization signals. However, both averaged hybridization signals exhibit growing trends along with an increase of probes’ binding energy, where the hybridization specific signal increases significantly faster in comparison to the cross-hybridization. The same trend is evident for the S function, which serves as a combined evaluation of probe binding energy and occurrence of probe’s k-mers in the genome. Application of S allows extracting a larger number of specific probes, as compared to using only binding energy. Thus, we showed that high values of specific and cross-hybridization signals are not mutually exclusive for probes with high values of binding energy and S. In this study, the application of a new set of sequence characteristics allows detection of probes that are highly specific to their targets for array design and other bio-techniques that require selection of specific probes.


Url:
DOI: 10.1371/journal.pone.0199162
PubMed: 29928000
PubMed Central: 6013149

Links to Exploration step

PMC:6013149

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Sequence characteristics define trade-offs between on-target and genome-wide off-target hybridization of oligoprobes</title>
<author>
<name sortKey="Matveeva, Olga V" sort="Matveeva, Olga V" uniqKey="Matveeva O" first="Olga V." last="Matveeva">Olga V. Matveeva</name>
<affiliation>
<nlm:aff id="aff001">
<addr-line>Biopolymer Design LLC, Acton, Massachusetts, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Ogurtsov, Aleksey Y" sort="Ogurtsov, Aleksey Y" uniqKey="Ogurtsov A" first="Aleksey Y." last="Ogurtsov">Aleksey Y. Ogurtsov</name>
<affiliation>
<nlm:aff id="aff002">
<addr-line>National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Nazipova, Nafisa N" sort="Nazipova, Nafisa N" uniqKey="Nazipova N" first="Nafisa N." last="Nazipova">Nafisa N. Nazipova</name>
<affiliation>
<nlm:aff id="aff003">
<addr-line>Institute of Mathematical Problems of Biology, RAS – the Branch of Keldysh Institute of Applied Mathematics of Russian Academy of Sciences, Pushchino, Moscow Region, Russia</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Shabalina, Svetlana A" sort="Shabalina, Svetlana A" uniqKey="Shabalina S" first="Svetlana A." last="Shabalina">Svetlana A. Shabalina</name>
<affiliation>
<nlm:aff id="aff002">
<addr-line>National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">29928000</idno>
<idno type="pmc">6013149</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6013149</idno>
<idno type="RBID">PMC:6013149</idno>
<idno type="doi">10.1371/journal.pone.0199162</idno>
<date when="2018">2018</date>
<idno type="wicri:Area/Pmc/Corpus">001045</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Corpus" wicri:corpus="PMC">001045</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Sequence characteristics define trade-offs between on-target and genome-wide off-target hybridization of oligoprobes</title>
<author>
<name sortKey="Matveeva, Olga V" sort="Matveeva, Olga V" uniqKey="Matveeva O" first="Olga V." last="Matveeva">Olga V. Matveeva</name>
<affiliation>
<nlm:aff id="aff001">
<addr-line>Biopolymer Design LLC, Acton, Massachusetts, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Ogurtsov, Aleksey Y" sort="Ogurtsov, Aleksey Y" uniqKey="Ogurtsov A" first="Aleksey Y." last="Ogurtsov">Aleksey Y. Ogurtsov</name>
<affiliation>
<nlm:aff id="aff002">
<addr-line>National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Nazipova, Nafisa N" sort="Nazipova, Nafisa N" uniqKey="Nazipova N" first="Nafisa N." last="Nazipova">Nafisa N. Nazipova</name>
<affiliation>
<nlm:aff id="aff003">
<addr-line>Institute of Mathematical Problems of Biology, RAS – the Branch of Keldysh Institute of Applied Mathematics of Russian Academy of Sciences, Pushchino, Moscow Region, Russia</addr-line>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Shabalina, Svetlana A" sort="Shabalina, Svetlana A" uniqKey="Shabalina S" first="Svetlana A." last="Shabalina">Svetlana A. Shabalina</name>
<affiliation>
<nlm:aff id="aff002">
<addr-line>National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America</addr-line>
</nlm:aff>
</affiliation>
</author>
</analytic>
<series>
<title level="j">PLoS ONE</title>
<idno type="eISSN">1932-6203</idno>
<imprint>
<date when="2018">2018</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p>Off-target oligoprobe’s interaction with partially complementary nucleotide sequences represents a problem for many bio-techniques. The goal of the study was to identify oligoprobe sequence characteristics that control the ratio between on-target and off-target hybridization. To understand the complex interplay between specific and genome-wide off-target (cross-hybridization) signals, we analyzed a database derived from genomic comparison hybridization experiments performed with an Affymetrix tiling array. The database included two types of probes with signals derived from (i) a combination of specific signal and cross-hybridization and (ii) genomic cross-hybridization only. All probes from the database were grouped into bins according to their sequence characteristics, where both hybridization signals were averaged separately. For selection of specific probes, we analyzed the following sequence characteristics: vulnerability to self-folding, nucleotide composition bias, numbers of G nucleotides and GGG-blocks, and occurrence of probe’s
<italic>k</italic>
-mers in the human genome. Increases in bin ranges for these characteristics are simultaneously accompanied by a decrease in hybridization specificity—the ratio between specific and cross-hybridization signals. However, both averaged hybridization signals exhibit growing trends along with an increase of probes’ binding energy, where the hybridization specific signal increases significantly faster in comparison to the cross-hybridization. The same trend is evident for the
<italic>S</italic>
function, which serves as a combined evaluation of probe binding energy and occurrence of probe’s
<italic>k</italic>
-mers in the genome. Application of
<italic>S</italic>
allows extracting a larger number of specific probes, as compared to using only binding energy. Thus, we showed that high values of specific and cross-hybridization signals are not mutually exclusive for probes with high values of binding energy and
<italic>S</italic>
. In this study, the application of a new set of sequence characteristics allows detection of probes that are highly specific to their targets for array design and other bio-techniques that require selection of specific probes.</p>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct>
<analytic>
<author>
<name sortKey="Horak, Ce" uniqKey="Horak C">CE Horak</name>
</author>
<author>
<name sortKey="Snyder, M" uniqKey="Snyder M">M Snyder</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Duan, F" uniqKey="Duan F">F Duan</name>
</author>
<author>
<name sortKey="Pauley, Ma" uniqKey="Pauley M">MA Pauley</name>
</author>
<author>
<name sortKey="Spindel, Er" uniqKey="Spindel E">ER Spindel</name>
</author>
<author>
<name sortKey="Zhang, L" uniqKey="Zhang L">L Zhang</name>
</author>
<author>
<name sortKey="Norgren, Rb" uniqKey="Norgren R">RB Norgren</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Gresham, D" uniqKey="Gresham D">D Gresham</name>
</author>
<author>
<name sortKey="Curry, B" uniqKey="Curry B">B Curry</name>
</author>
<author>
<name sortKey="Ward, A" uniqKey="Ward A">A Ward</name>
</author>
<author>
<name sortKey="Gordon, Db" uniqKey="Gordon D">DB Gordon</name>
</author>
<author>
<name sortKey="Brizuela, L" uniqKey="Brizuela L">L Brizuela</name>
</author>
<author>
<name sortKey="Kruglyak, L" uniqKey="Kruglyak L">L Kruglyak</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hurd, Pj" uniqKey="Hurd P">PJ Hurd</name>
</author>
<author>
<name sortKey="Nelson, Cj" uniqKey="Nelson C">CJ Nelson</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Aston, E" uniqKey="Aston E">E Aston</name>
</author>
<author>
<name sortKey="Whitby, H" uniqKey="Whitby H">H Whitby</name>
</author>
<author>
<name sortKey="Maxwell, T" uniqKey="Maxwell T">T Maxwell</name>
</author>
<author>
<name sortKey="Glaus, N" uniqKey="Glaus N">N Glaus</name>
</author>
<author>
<name sortKey="Cowley, B" uniqKey="Cowley B">B Cowley</name>
</author>
<author>
<name sortKey="Lowry, D" uniqKey="Lowry D">D Lowry</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ahn, Jw" uniqKey="Ahn J">JW Ahn</name>
</author>
<author>
<name sortKey="Mann, K" uniqKey="Mann K">K Mann</name>
</author>
<author>
<name sortKey="Walsh, S" uniqKey="Walsh S">S Walsh</name>
</author>
<author>
<name sortKey="Shehab, M" uniqKey="Shehab M">M Shehab</name>
</author>
<author>
<name sortKey="Hoang, S" uniqKey="Hoang S">S Hoang</name>
</author>
<author>
<name sortKey="Docherty, Z" uniqKey="Docherty Z">Z Docherty</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Jaksik, R" uniqKey="Jaksik R">R Jaksik</name>
</author>
<author>
<name sortKey="Iwanaszko, M" uniqKey="Iwanaszko M">M Iwanaszko</name>
</author>
<author>
<name sortKey="Rzeszowska Wolny, J" uniqKey="Rzeszowska Wolny J">J Rzeszowska-Wolny</name>
</author>
<author>
<name sortKey="Kimmel, M" uniqKey="Kimmel M">M Kimmel</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Carter, Np" uniqKey="Carter N">NP Carter</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Pozhitkov, Ae" uniqKey="Pozhitkov A">AE Pozhitkov</name>
</author>
<author>
<name sortKey="Noble, Pa" uniqKey="Noble P">PA Noble</name>
</author>
<author>
<name sortKey="Bryk, J" uniqKey="Bryk J">J Bryk</name>
</author>
<author>
<name sortKey="Tautz, D" uniqKey="Tautz D">D Tautz</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Fasold, M" uniqKey="Fasold M">M Fasold</name>
</author>
<author>
<name sortKey="Binder, H" uniqKey="Binder H">H Binder</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Binder, H" uniqKey="Binder H">H Binder</name>
</author>
<author>
<name sortKey="Preibisch, S" uniqKey="Preibisch S">S Preibisch</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wu, C" uniqKey="Wu C">C Wu</name>
</author>
<author>
<name sortKey="Zhao, H" uniqKey="Zhao H">H Zhao</name>
</author>
<author>
<name sortKey="Baggerly, K" uniqKey="Baggerly K">K Baggerly</name>
</author>
<author>
<name sortKey="Carta, R" uniqKey="Carta R">R Carta</name>
</author>
<author>
<name sortKey="Zhang, L" uniqKey="Zhang L">L Zhang</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Upton, G" uniqKey="Upton G">G Upton</name>
</author>
<author>
<name sortKey="Langdon, W" uniqKey="Langdon W">W Langdon</name>
</author>
<author>
<name sortKey="Harrison, A" uniqKey="Harrison A">A Harrison</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Langdon, Wb" uniqKey="Langdon W">WB Langdon</name>
</author>
<author>
<name sortKey="Upton, Gjg" uniqKey="Upton G">GJG Upton</name>
</author>
<author>
<name sortKey="Harrison, Ap" uniqKey="Harrison A">AP Harrison</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Binder, H" uniqKey="Binder H">H Binder</name>
</author>
<author>
<name sortKey="Fasold, M" uniqKey="Fasold M">M Fasold</name>
</author>
<author>
<name sortKey="Glomb, T" uniqKey="Glomb T">T Glomb</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Upton, Gjg" uniqKey="Upton G">GJG Upton</name>
</author>
<author>
<name sortKey="Sanchez Graillet, O" uniqKey="Sanchez Graillet O">O Sanchez-Graillet</name>
</author>
<author>
<name sortKey="Rowsell, J" uniqKey="Rowsell J">J Rowsell</name>
</author>
<author>
<name sortKey="Arteaga Salas, Jm" uniqKey="Arteaga Salas J">JM Arteaga-Salas</name>
</author>
<author>
<name sortKey="Graham, Ns" uniqKey="Graham N">NS Graham</name>
</author>
<author>
<name sortKey="Stalteri, Ma" uniqKey="Stalteri M">MA Stalteri</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Fasold, M" uniqKey="Fasold M">M Fasold</name>
</author>
<author>
<name sortKey="Stadler, Pf" uniqKey="Stadler P">PF Stadler</name>
</author>
<author>
<name sortKey="Binder, H" uniqKey="Binder H">H Binder</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Memon, Fn" uniqKey="Memon F">FN Memon</name>
</author>
<author>
<name sortKey="Upton, Gjg" uniqKey="Upton G">GJG Upton</name>
</author>
<author>
<name sortKey="Harrison, Ap" uniqKey="Harrison A">AP Harrison</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Matveeva, Ov" uniqKey="Matveeva O">OV Matveeva</name>
</author>
<author>
<name sortKey="Mathews, Dh" uniqKey="Mathews D">DH Mathews</name>
</author>
<author>
<name sortKey="Tsodikov, Ad" uniqKey="Tsodikov A">AD Tsodikov</name>
</author>
<author>
<name sortKey="Shabalina, Sa" uniqKey="Shabalina S">SA Shabalina</name>
</author>
<author>
<name sortKey="Gesteland, Rf" uniqKey="Gesteland R">RF Gesteland</name>
</author>
<author>
<name sortKey="Atkins, Jf" uniqKey="Atkins J">JF Atkins</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Gharaibeh, Rz" uniqKey="Gharaibeh R">RZ Gharaibeh</name>
</author>
<author>
<name sortKey="Fodor, Aa" uniqKey="Fodor A">AA Fodor</name>
</author>
<author>
<name sortKey="Gibas, Cj" uniqKey="Gibas C">CJ Gibas</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Gharaibeh, Rz" uniqKey="Gharaibeh R">RZ Gharaibeh</name>
</author>
<author>
<name sortKey="Fodor, Aa" uniqKey="Fodor A">AA Fodor</name>
</author>
<author>
<name sortKey="Gibas, Cj" uniqKey="Gibas C">CJ Gibas</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Jakubek, Ya" uniqKey="Jakubek Y">YA Jakubek</name>
</author>
<author>
<name sortKey="Cutler, Dj" uniqKey="Cutler D">DJ Cutler</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Matveeva, Ov" uniqKey="Matveeva O">OV Matveeva</name>
</author>
<author>
<name sortKey="Shabalina, Sa" uniqKey="Shabalina S">SA Shabalina</name>
</author>
<author>
<name sortKey="Nemtsov, Va" uniqKey="Nemtsov V">VA Nemtsov</name>
</author>
<author>
<name sortKey="Tsodikov, Ad" uniqKey="Tsodikov A">AD Tsodikov</name>
</author>
<author>
<name sortKey="Gesteland, Rf" uniqKey="Gesteland R">RF Gesteland</name>
</author>
<author>
<name sortKey="Atkins, Jf" uniqKey="Atkins J">JF Atkins</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Carlon, E" uniqKey="Carlon E">E Carlon</name>
</author>
<author>
<name sortKey="Heim, T" uniqKey="Heim T">T Heim</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Weckx, S" uniqKey="Weckx S">S Weckx</name>
</author>
<author>
<name sortKey="Carlon, E" uniqKey="Carlon E">E Carlon</name>
</author>
<author>
<name sortKey="Devuyst, L" uniqKey="Devuyst L">L DeVuyst</name>
</author>
<author>
<name sortKey="Van Hummelen, P" uniqKey="Van Hummelen P">P Van Hummelen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hooyberghs, J" uniqKey="Hooyberghs J">J Hooyberghs</name>
</author>
<author>
<name sortKey="Van Hummelen, P" uniqKey="Van Hummelen P">P Van Hummelen</name>
</author>
<author>
<name sortKey="Carlon, E" uniqKey="Carlon E">E Carlon</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wu, C" uniqKey="Wu C">C Wu</name>
</author>
<author>
<name sortKey="Carta, R" uniqKey="Carta R">R Carta</name>
</author>
<author>
<name sortKey="Zhang, L" uniqKey="Zhang L">L Zhang</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Garhyan, J" uniqKey="Garhyan J">J Garhyan</name>
</author>
<author>
<name sortKey="Gharaibeh, Rz" uniqKey="Gharaibeh R">RZ Gharaibeh</name>
</author>
<author>
<name sortKey="Mcgee, S" uniqKey="Mcgee S">S McGee</name>
</author>
<author>
<name sortKey="Gibas, Cj" uniqKey="Gibas C">CJ Gibas</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kapur, K" uniqKey="Kapur K">K Kapur</name>
</author>
<author>
<name sortKey="Jiang, H" uniqKey="Jiang H">H Jiang</name>
</author>
<author>
<name sortKey="Xing, Y" uniqKey="Xing Y">Y Xing</name>
</author>
<author>
<name sortKey="Wong, Wh" uniqKey="Wong W">WH Wong</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Gr F, S" uniqKey="Gr F S">S Gräf</name>
</author>
<author>
<name sortKey="Nielsen, Fgg" uniqKey="Nielsen F">FGG Nielsen</name>
</author>
<author>
<name sortKey="Kurtz, S" uniqKey="Kurtz S">S Kurtz</name>
</author>
<author>
<name sortKey="Huynen, Ma" uniqKey="Huynen M">MA Huynen</name>
</author>
<author>
<name sortKey="Birney, E" uniqKey="Birney E">E Birney</name>
</author>
<author>
<name sortKey="Stunnenberg, H" uniqKey="Stunnenberg H">H Stunnenberg</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Du, Y" uniqKey="Du Y">Y Du</name>
</author>
<author>
<name sortKey="Murani, E" uniqKey="Murani E">E Murani</name>
</author>
<author>
<name sortKey="Ponsuksili, S" uniqKey="Ponsuksili S">S Ponsuksili</name>
</author>
<author>
<name sortKey="Wimmers, K" uniqKey="Wimmers K">K Wimmers</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Binder, H" uniqKey="Binder H">H Binder</name>
</author>
<author>
<name sortKey="Preibisch, S" uniqKey="Preibisch S">S Preibisch</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Zhang, L" uniqKey="Zhang L">L Zhang</name>
</author>
<author>
<name sortKey="Miles, Mf" uniqKey="Miles M">MF Miles</name>
</author>
<author>
<name sortKey="Aldape, Kd" uniqKey="Aldape K">KD Aldape</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Furusawa, C" uniqKey="Furusawa C">C Furusawa</name>
</author>
<author>
<name sortKey="Ono, N" uniqKey="Ono N">N Ono</name>
</author>
<author>
<name sortKey="Suzuki, S" uniqKey="Suzuki S">S Suzuki</name>
</author>
<author>
<name sortKey="Agata, T" uniqKey="Agata T">T Agata</name>
</author>
<author>
<name sortKey="Shimizu, H" uniqKey="Shimizu H">H Shimizu</name>
</author>
<author>
<name sortKey="Yomo, T" uniqKey="Yomo T">T Yomo</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Becker, J" uniqKey="Becker J">J Becker</name>
</author>
<author>
<name sortKey="Perot, P" uniqKey="Perot P">P Pérot</name>
</author>
<author>
<name sortKey="Cheynet, V" uniqKey="Cheynet V">V Cheynet</name>
</author>
<author>
<name sortKey="Oriol, G" uniqKey="Oriol G">G Oriol</name>
</author>
<author>
<name sortKey="Mugnier, N" uniqKey="Mugnier N">N Mugnier</name>
</author>
<author>
<name sortKey="Mommert, M" uniqKey="Mommert M">M Mommert</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Xia, X Q" uniqKey="Xia X">X-Q Xia</name>
</author>
<author>
<name sortKey="Jia, Z" uniqKey="Jia Z">Z Jia</name>
</author>
<author>
<name sortKey="Porwollik, S" uniqKey="Porwollik S">S Porwollik</name>
</author>
<author>
<name sortKey="Long, F" uniqKey="Long F">F Long</name>
</author>
<author>
<name sortKey="Hoemme, C" uniqKey="Hoemme C">C Hoemme</name>
</author>
<author>
<name sortKey="Ye, K" uniqKey="Ye K">K Ye</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Matveeva, Ov" uniqKey="Matveeva O">OV Matveeva</name>
</author>
<author>
<name sortKey="Nechipurenko, Yd" uniqKey="Nechipurenko Y">YD Nechipurenko</name>
</author>
<author>
<name sortKey="Riabenko, E" uniqKey="Riabenko E">E Riabenko</name>
</author>
<author>
<name sortKey="Ragan, C" uniqKey="Ragan C">C Ragan</name>
</author>
<author>
<name sortKey="Nazipova, Nn" uniqKey="Nazipova N">NN Nazipova</name>
</author>
<author>
<name sortKey="Ogurtsov, Ay" uniqKey="Ogurtsov A">AY Ogurtsov</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Matveeva, Ov" uniqKey="Matveeva O">OV Matveeva</name>
</author>
<author>
<name sortKey="Tsodikov, Ad" uniqKey="Tsodikov A">AD Tsodikov</name>
</author>
<author>
<name sortKey="Giddins, M" uniqKey="Giddins M">M Giddins</name>
</author>
<author>
<name sortKey="Freier, Sm" uniqKey="Freier S">SM Freier</name>
</author>
<author>
<name sortKey="Wyatt, Jr" uniqKey="Wyatt J">JR Wyatt</name>
</author>
<author>
<name sortKey="Spiridonov, An" uniqKey="Spiridonov A">AN Spiridonov</name>
</author>
<author>
<name sortKey="Shabalina, Sa" uniqKey="Shabalina S">SA Shabalina</name>
</author>
<author>
<name sortKey="Gesteland, Rf" uniqKey="Gesteland R">RF Gesteland</name>
</author>
<author>
<name sortKey="Atkins, Jf" uniqKey="Atkins J">JF Atkins</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kondrashov, As" uniqKey="Kondrashov A">AS Kondrashov</name>
</author>
<author>
<name sortKey="Shabalina, Sa" uniqKey="Shabalina S">SA Shabalina</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Webb, Ct" uniqKey="Webb C">CT Webb</name>
</author>
<author>
<name sortKey="Shabalina, Sa" uniqKey="Shabalina S">SA Shabalina</name>
</author>
<author>
<name sortKey="Ogurtsov, Ay" uniqKey="Ogurtsov A">AY Ogurtsov</name>
</author>
<author>
<name sortKey="Kondrashov, As" uniqKey="Kondrashov A">AS Kondrashov</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ogurtsov, Ay" uniqKey="Ogurtsov A">AY Ogurtsov</name>
</author>
<author>
<name sortKey="Shabalina, Sa" uniqKey="Shabalina S">SA Shabalina</name>
</author>
<author>
<name sortKey="Kondrashov, As" uniqKey="Kondrashov A">AS Kondrashov</name>
</author>
<author>
<name sortKey="Roytberg, Ma" uniqKey="Roytberg M">MA Roytberg</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ogurtsov, Ay" uniqKey="Ogurtsov A">AY Ogurtsov</name>
</author>
<author>
<name sortKey="Mari O Ramirez, L" uniqKey="Mari O Ramirez L">L Mariño-Ramírez</name>
</author>
<author>
<name sortKey="Johnson, Gr" uniqKey="Johnson G">GR Johnson</name>
</author>
<author>
<name sortKey="Landsman, D" uniqKey="Landsman D">D Landsman</name>
</author>
<author>
<name sortKey="Shabalina, Sa" uniqKey="Shabalina S">SA Shabalina</name>
</author>
<author>
<name sortKey="Spiridonov, Na" uniqKey="Spiridonov N">NA Spiridonov</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Santalucia, J" uniqKey="Santalucia J">J SantaLucia</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Matveeva, Ov" uniqKey="Matveeva O">OV Matveeva</name>
</author>
<author>
<name sortKey="Nazipova, Nn" uniqKey="Nazipova N">NN Nazipova</name>
</author>
<author>
<name sortKey="Ogurtsov, Ay" uniqKey="Ogurtsov A">AY Ogurtsov</name>
</author>
<author>
<name sortKey="Shabalina, Sa" uniqKey="Shabalina S">SA Shabalina</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Matveeva, Ov" uniqKey="Matveeva O">OV Matveeva</name>
</author>
<author>
<name sortKey="Kang, Y" uniqKey="Kang Y">Y Kang</name>
</author>
<author>
<name sortKey="Spiridonov, An" uniqKey="Spiridonov A">AN Spiridonov</name>
</author>
<author>
<name sortKey="Saetrom, P" uniqKey="Saetrom P">P Saetrom</name>
</author>
<author>
<name sortKey="Nemtsov, Va" uniqKey="Nemtsov V">VA Nemtsov</name>
</author>
<author>
<name sortKey="Ogurtsov, Ay" uniqKey="Ogurtsov A">AY Ogurtsov</name>
</author>
<author>
<name sortKey="Nechipurenko, Yd" uniqKey="Nechipurenko Y">YD Nechipurenko</name>
</author>
<author>
<name sortKey="Shabalina, Sa" uniqKey="Shabalina S">SA Shabalina</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hadiwikarta, Ww" uniqKey="Hadiwikarta W">WW Hadiwikarta</name>
</author>
<author>
<name sortKey="Carlon, E" uniqKey="Carlon E">E Carlon</name>
</author>
<author>
<name sortKey="Hooyberghs, J" uniqKey="Hooyberghs J">J Hooyberghs</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Cho, H" uniqKey="Cho H">H Cho</name>
</author>
<author>
<name sortKey="Chou, H H" uniqKey="Chou H">H-H Chou</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article">
<pmc-dir>properties open_access</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">PLoS One</journal-id>
<journal-id journal-id-type="iso-abbrev">PLoS ONE</journal-id>
<journal-id journal-id-type="publisher-id">plos</journal-id>
<journal-id journal-id-type="pmc">plosone</journal-id>
<journal-title-group>
<journal-title>PLoS ONE</journal-title>
</journal-title-group>
<issn pub-type="epub">1932-6203</issn>
<publisher>
<publisher-name>Public Library of Science</publisher-name>
<publisher-loc>San Francisco, CA USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">29928000</article-id>
<article-id pub-id-type="pmc">6013149</article-id>
<article-id pub-id-type="doi">10.1371/journal.pone.0199162</article-id>
<article-id pub-id-type="publisher-id">PONE-D-18-06956</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Research Article</subject>
</subj-group>
<subj-group subj-group-type="Discipline-v3">
<subject>Biology and Life Sciences</subject>
<subj-group>
<subject>Molecular Biology</subject>
<subj-group>
<subject>Molecular Biology Techniques</subject>
<subj-group>
<subject>Molecular Probe Techniques</subject>
<subj-group>
<subject>Probe Hybridization</subject>
</subj-group>
</subj-group>
</subj-group>
</subj-group>
</subj-group>
<subj-group subj-group-type="Discipline-v3">
<subject>Research and Analysis Methods</subject>
<subj-group>
<subject>Molecular Biology Techniques</subject>
<subj-group>
<subject>Molecular Probe Techniques</subject>
<subj-group>
<subject>Probe Hybridization</subject>
</subj-group>
</subj-group>
</subj-group>
</subj-group>
<subj-group subj-group-type="Discipline-v3">
<subject>Biology and Life Sciences</subject>
<subj-group>
<subject>Cell Biology</subject>
<subj-group>
<subject>Signal Transduction</subject>
<subj-group>
<subject>Cell Signaling</subject>
<subj-group>
<subject>Genomic Signal Processing</subject>
</subj-group>
</subj-group>
</subj-group>
</subj-group>
</subj-group>
<subj-group subj-group-type="Discipline-v3">
<subject>Research and Analysis Methods</subject>
<subj-group>
<subject>Database and Informatics Methods</subject>
<subj-group>
<subject>Biological Databases</subject>
<subj-group>
<subject>Sequence Databases</subject>
</subj-group>
</subj-group>
</subj-group>
</subj-group>
<subj-group subj-group-type="Discipline-v3">
<subject>Research and Analysis Methods</subject>
<subj-group>
<subject>Database and Informatics Methods</subject>
<subj-group>
<subject>Bioinformatics</subject>
<subj-group>
<subject>Sequence Analysis</subject>
<subj-group>
<subject>Sequence Databases</subject>
</subj-group>
</subj-group>
</subj-group>
</subj-group>
</subj-group>
<subj-group subj-group-type="Discipline-v3">
<subject>Research and Analysis Methods</subject>
<subj-group>
<subject>Bioassays and Physiological Analysis</subject>
<subj-group>
<subject>Microarrays</subject>
</subj-group>
</subj-group>
</subj-group>
<subj-group subj-group-type="Discipline-v3">
<subject>Biology and Life Sciences</subject>
<subj-group>
<subject>Molecular Biology</subject>
<subj-group>
<subject>Molecular Biology Techniques</subject>
<subj-group>
<subject>Sequencing Techniques</subject>
<subj-group>
<subject>Nucleotide Sequencing</subject>
</subj-group>
</subj-group>
</subj-group>
</subj-group>
</subj-group>
<subj-group subj-group-type="Discipline-v3">
<subject>Research and Analysis Methods</subject>
<subj-group>
<subject>Molecular Biology Techniques</subject>
<subj-group>
<subject>Sequencing Techniques</subject>
<subj-group>
<subject>Nucleotide Sequencing</subject>
</subj-group>
</subj-group>
</subj-group>
</subj-group>
<subj-group subj-group-type="Discipline-v3">
<subject>Research and Analysis Methods</subject>
<subj-group>
<subject>Database and Informatics Methods</subject>
<subj-group>
<subject>Bioinformatics</subject>
<subj-group>
<subject>Sequence Analysis</subject>
</subj-group>
</subj-group>
</subj-group>
</subj-group>
<subj-group subj-group-type="Discipline-v3">
<subject>Biology and Life Sciences</subject>
<subj-group>
<subject>Genetics</subject>
<subj-group>
<subject>Genomics</subject>
<subj-group>
<subject>Human Genomics</subject>
</subj-group>
</subj-group>
</subj-group>
</subj-group>
<subj-group subj-group-type="Discipline-v3">
<subject>Research and Analysis Methods</subject>
<subj-group>
<subject>Database and Informatics Methods</subject>
<subj-group>
<subject>Biological Databases</subject>
<subj-group>
<subject>Genomic Databases</subject>
</subj-group>
</subj-group>
</subj-group>
</subj-group>
<subj-group subj-group-type="Discipline-v3">
<subject>Biology and Life Sciences</subject>
<subj-group>
<subject>Computational Biology</subject>
<subj-group>
<subject>Genome Analysis</subject>
<subj-group>
<subject>Genomic Databases</subject>
</subj-group>
</subj-group>
</subj-group>
</subj-group>
<subj-group subj-group-type="Discipline-v3">
<subject>Biology and Life Sciences</subject>
<subj-group>
<subject>Genetics</subject>
<subj-group>
<subject>Genomics</subject>
<subj-group>
<subject>Genome Analysis</subject>
<subj-group>
<subject>Genomic Databases</subject>
</subj-group>
</subj-group>
</subj-group>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Sequence characteristics define trade-offs between on-target and genome-wide off-target hybridization of oligoprobes</article-title>
<alt-title alt-title-type="running-head">Trade-offs between on-target and genome-wide off-target hybridization</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Matveeva</surname>
<given-names>Olga V.</given-names>
</name>
<role content-type="http://credit.casrai.org/">Data curation</role>
<role content-type="http://credit.casrai.org/">Formal analysis</role>
<role content-type="http://credit.casrai.org/">Investigation</role>
<role content-type="http://credit.casrai.org/">Project administration</role>
<role content-type="http://credit.casrai.org/">Writing – original draft</role>
<xref ref-type="aff" rid="aff001">
<sup>1</sup>
</xref>
<xref ref-type="corresp" rid="cor001">*</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Ogurtsov</surname>
<given-names>Aleksey Y.</given-names>
</name>
<role content-type="http://credit.casrai.org/">Formal analysis</role>
<role content-type="http://credit.casrai.org/">Methodology</role>
<role content-type="http://credit.casrai.org/">Software</role>
<role content-type="http://credit.casrai.org/">Validation</role>
<xref ref-type="aff" rid="aff002">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Nazipova</surname>
<given-names>Nafisa N.</given-names>
</name>
<role content-type="http://credit.casrai.org/">Data curation</role>
<role content-type="http://credit.casrai.org/">Formal analysis</role>
<role content-type="http://credit.casrai.org/">Methodology</role>
<xref ref-type="aff" rid="aff003">
<sup>3</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<contrib-id authenticated="true" contrib-id-type="orcid">http://orcid.org/0000-0002-3774-819X</contrib-id>
<name>
<surname>Shabalina</surname>
<given-names>Svetlana A.</given-names>
</name>
<role content-type="http://credit.casrai.org/">Conceptualization</role>
<role content-type="http://credit.casrai.org/">Formal analysis</role>
<role content-type="http://credit.casrai.org/">Methodology</role>
<role content-type="http://credit.casrai.org/">Software</role>
<role content-type="http://credit.casrai.org/">Writing – original draft</role>
<role content-type="http://credit.casrai.org/">Writing – review & editing</role>
<xref ref-type="aff" rid="aff002">
<sup>2</sup>
</xref>
<xref ref-type="corresp" rid="cor001">*</xref>
</contrib>
</contrib-group>
<aff id="aff001">
<label>1</label>
<addr-line>Biopolymer Design LLC, Acton, Massachusetts, United States of America</addr-line>
</aff>
<aff id="aff002">
<label>2</label>
<addr-line>National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America</addr-line>
</aff>
<aff id="aff003">
<label>3</label>
<addr-line>Institute of Mathematical Problems of Biology, RAS – the Branch of Keldysh Institute of Applied Mathematics of Russian Academy of Sciences, Pushchino, Moscow Region, Russia</addr-line>
</aff>
<contrib-group>
<contrib contrib-type="editor">
<name>
<surname>Kalendar</surname>
<given-names>Ruslan</given-names>
</name>
<role>Editor</role>
<xref ref-type="aff" rid="edit1"></xref>
</contrib>
</contrib-group>
<aff id="edit1">
<addr-line>University of Helsinki, FINLAND</addr-line>
</aff>
<author-notes>
<fn fn-type="COI-statement" id="coi001">
<p>
<bold>Competing Interests: </bold>
The commercial affiliation Biopolymer Design LLC does not alter our adherence to PLOS ONE policies on sharing data and materials.</p>
</fn>
<corresp id="cor001">* E-mail:
<email>olga.matveeva@gmail.com</email>
(OVM);
<email>shabalin@ncbi.nlm.nih.gov</email>
(SAS)</corresp>
</author-notes>
<pub-date pub-type="epub">
<day>21</day>
<month>6</month>
<year>2018</year>
</pub-date>
<pub-date pub-type="collection">
<year>2018</year>
</pub-date>
<volume>13</volume>
<issue>6</issue>
<elocation-id>e0199162</elocation-id>
<history>
<date date-type="received">
<day>12</day>
<month>3</month>
<year>2018</year>
</date>
<date date-type="accepted">
<day>2</day>
<month>6</month>
<year>2018</year>
</date>
</history>
<permissions>
<license xlink:href="https://creativecommons.org/publicdomain/zero/1.0/">
<license-p>This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the
<ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/publicdomain/zero/1.0/">Creative Commons CC0</ext-link>
public domain dedication.</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="pone.0199162.pdf"></self-uri>
<abstract>
<p>Off-target oligoprobe’s interaction with partially complementary nucleotide sequences represents a problem for many bio-techniques. The goal of the study was to identify oligoprobe sequence characteristics that control the ratio between on-target and off-target hybridization. To understand the complex interplay between specific and genome-wide off-target (cross-hybridization) signals, we analyzed a database derived from genomic comparison hybridization experiments performed with an Affymetrix tiling array. The database included two types of probes with signals derived from (i) a combination of specific signal and cross-hybridization and (ii) genomic cross-hybridization only. All probes from the database were grouped into bins according to their sequence characteristics, where both hybridization signals were averaged separately. For selection of specific probes, we analyzed the following sequence characteristics: vulnerability to self-folding, nucleotide composition bias, numbers of G nucleotides and GGG-blocks, and occurrence of probe’s
<italic>k</italic>
-mers in the human genome. Increases in bin ranges for these characteristics are simultaneously accompanied by a decrease in hybridization specificity—the ratio between specific and cross-hybridization signals. However, both averaged hybridization signals exhibit growing trends along with an increase of probes’ binding energy, where the hybridization specific signal increases significantly faster in comparison to the cross-hybridization. The same trend is evident for the
<italic>S</italic>
function, which serves as a combined evaluation of probe binding energy and occurrence of probe’s
<italic>k</italic>
-mers in the genome. Application of
<italic>S</italic>
allows extracting a larger number of specific probes, as compared to using only binding energy. Thus, we showed that high values of specific and cross-hybridization signals are not mutually exclusive for probes with high values of binding energy and
<italic>S</italic>
. In this study, the application of a new set of sequence characteristics allows detection of probes that are highly specific to their targets for array design and other bio-techniques that require selection of specific probes.</p>
</abstract>
<funding-group>
<award-group id="award001">
<funding-source>
<institution>The research was supported by the Department of Health and Human Services (National Institutes of Health, National Library of Medicine) intramural funds</institution>
</funding-source>
<principal-award-recipient>
<contrib-id authenticated="true" contrib-id-type="orcid">http://orcid.org/0000-0002-3774-819X</contrib-id>
<name>
<surname>Shabalina</surname>
<given-names>Svetlana A.</given-names>
</name>
</principal-award-recipient>
</award-group>
<award-group id="award002">
<funding-source>
<institution>The research was supported by the Department of Health and Human Services (National Institutes of Health, National Library of Medicine) intramural funds</institution>
</funding-source>
<principal-award-recipient>
<name>
<surname>Ogurtsov</surname>
<given-names>Aleksey Y.</given-names>
</name>
</principal-award-recipient>
</award-group>
<award-group id="award003">
<funding-source>
<institution>The funder Biopolymer Design LLC provided support in the form of salary for author OVM, but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific role of this author is articulated in the ‘author contributions’ section.</institution>
</funding-source>
<principal-award-recipient>
<name>
<surname>Matveeva</surname>
<given-names>Olga V.</given-names>
</name>
</principal-award-recipient>
</award-group>
<funding-statement>The research was supported by the Department of Health and Human Services (National Institutes of Health, National Library of Medicine) intramural funds to SAS and AYO. The funder Biopolymer Design LLC provided support in the form of salary for author OVM, but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.</funding-statement>
</funding-group>
<counts>
<fig-count count="7"></fig-count>
<table-count count="1"></table-count>
<page-count count="20"></page-count>
</counts>
<custom-meta-group>
<custom-meta id="data-availability">
<meta-name>Data Availability</meta-name>
<meta-value>All relevant data are within the paper and its Supporting Information files.</meta-value>
</custom-meta>
</custom-meta-group>
</article-meta>
<notes>
<title>Data Availability</title>
<p>All relevant data are within the paper and its Supporting Information files.</p>
</notes>
</front>
<body>
<sec sec-type="intro" id="sec001">
<title>Introduction</title>
<p>Many biotechnology applications involve oligoprobe hybridization with complementary targets in DNA or RNA as a basic procedural step. One such application is microarray technology. High throughput sequencing is gradually replacing microarrays as the preferred method for studying cellular transcript expression levels. However, microarrays are still dominating certain applications, such as identification of transcription binding sites [
<xref rid="pone.0199162.ref001" ref-type="bibr">1</xref>
], and gene copy number evaluations and genotyping [
<xref rid="pone.0199162.ref002" ref-type="bibr">2</xref>
<xref rid="pone.0199162.ref003" ref-type="bibr">3</xref>
]. It is possible to envision a powerful symbiosis between microarrays and new generation sequencing technologies [
<xref rid="pone.0199162.ref004" ref-type="bibr">4</xref>
].</p>
<p>Desirable reactions between an oligoprobe and its complementary target are frequently complicated by other undesirable interactions of the probe. Particularly problematic is off-target probe binding with partially complementary DNA or RNA sequences, which are almost always present in a hybridization mixture. These interactions happen in parallel with on-target interactions. Microarray hybridization is an excellent technology for characterizing an oligoprobe’s on-target and off-target interactions. A single microarray experiment, especially with comparative genomic hybridization (CGH) [
<xref rid="pone.0199162.ref005" ref-type="bibr">5</xref>
<xref rid="pone.0199162.ref006" ref-type="bibr">6</xref>
], allows visualization of millions of hybridization reactions. Despite many artifacts [
<xref rid="pone.0199162.ref007" ref-type="bibr">7</xref>
], no other technology provides such a high volume of useful information for analysis of specific oligoprobe-target interaction in a complex mixture of nonspecific reactions. A microarray hybridization signal consists of two components, target-specific and cross-hybridization. The ratio of these two components is a measure of hybridization specificity. The most specific probes produce the most reliable results.</p>
<p>One microarray application is the evaluation of gene copy number variation, and can be performed using CGH experiments [
<xref rid="pone.0199162.ref008" ref-type="bibr">8</xref>
] with Affymetrix tiling arrays that cover a whole genome with 25nt probes. Variation among individual probe signals is a huge drawback of array technology in general and, in particular, Affymetrix arrays. During post-experimental data analysis, signals from different tiled probes are averaged using a sliding window. Such averaging helps diminish the signal variability problem and allows better detection of deleted gene regions and those with variable copy numbers. However, signal averaging alone cannot eliminate the variability problem.</p>
<p>Hybridization signals derived from one microarray experiment even with the same target concentration are variable. Uneven hybridization conditions [
<xref rid="pone.0199162.ref009" ref-type="bibr">9</xref>
], target RNA quality [
<xref rid="pone.0199162.ref010" ref-type="bibr">10</xref>
<xref rid="pone.0199162.ref011" ref-type="bibr">11</xref>
], probe’s vulnerability to non-Watson Crick (non-WC) interactions through G-blocks [
<xref rid="pone.0199162.ref012" ref-type="bibr">12</xref>
<xref rid="pone.0199162.ref018" ref-type="bibr">18</xref>
], probe’s secondary structure [
<xref rid="pone.0199162.ref019" ref-type="bibr">19</xref>
<xref rid="pone.0199162.ref021" ref-type="bibr">21</xref>
] and probe’s synthesis failure [
<xref rid="pone.0199162.ref022" ref-type="bibr">22</xref>
], all play a role in signal variability. Hybridization signals are also affected by the probe’s binding energy, which defines the probe’s ability to form stable oligo-target duplexes [
<xref rid="pone.0199162.ref023" ref-type="bibr">23</xref>
<xref rid="pone.0199162.ref026" ref-type="bibr">26</xref>
]. Finally, different cross-hybridization signal components contribute to the signal’s variability.</p>
<p>A number of studies have analyzed factors that influence genome-wide cross-hybridization levels of microarray probes. Duplexes of 10–16 nucleotides that are complementary to targets may be sufficient to generate a cross-hybridization signal [
<xref rid="pone.0199162.ref027" ref-type="bibr">27</xref>
<xref rid="pone.0199162.ref028" ref-type="bibr">28</xref>
]. For 50-nt probes in particular, it was noted that “a complementary stretch of nucleotides as short as 12 nucleotides may result in the appearance of significant signal from an unintended binding partner” [
<xref rid="pone.0199162.ref028" ref-type="bibr">28</xref>
]. Shorter probes (25- nt) are hybridized in less stringent conditions compared to longer probes (50-nt). Therefore, much shorter complementary stretches might significantly contribute to the cross-hybridization signal. Kapur and co-authors [
<xref rid="pone.0199162.ref029" ref-type="bibr">29</xref>
] proposed a filtering method to detect and remove probes that have certain sequence-specific alignments with off-target transcripts. Similar approaches for filtering out potentially non-specific probes were suggested by others [
<xref rid="pone.0199162.ref030" ref-type="bibr">30</xref>
<xref rid="pone.0199162.ref031" ref-type="bibr">31</xref>
]. In these studies, the authors calculate a probe’s “uniqueness score” by evaluating the probe’s substrings frequency occurrence in a targeted genome. The main flaw of these studies [
<xref rid="pone.0199162.ref029" ref-type="bibr">29</xref>
<xref rid="pone.0199162.ref031" ref-type="bibr">31</xref>
] is a lack of consideration for the probe’s binding energies, which were shown to correlate significantly with cross-hybridization intensity [
<xref rid="pone.0199162.ref032" ref-type="bibr">32</xref>
<xref rid="pone.0199162.ref033" ref-type="bibr">33</xref>
]. Very few existing cross-hybridization models consider not only probe sequence similarity with non-target sequences, but also its thermodynamic features, including its binding energy [
<xref rid="pone.0199162.ref034" ref-type="bibr">34</xref>
<xref rid="pone.0199162.ref035" ref-type="bibr">35</xref>
].</p>
<p>To standardize terminology definition, we recommend that the scientific community discriminate between absolute cross-hybridization signals and relative cross-hybridization values. The term “absolute” cross-hybridization is used to identify signals that derive from probes that interact with partially complemented (off target) sequences only, e.g. without fully complemented targets. The term “relative” cross-hybridization represents the proportion of absolute cross-hybridization in an overall hybridization signal. Finally, the overall hybridization signal is represented by a sum of target specific and absolute cross-hybridization signals.</p>
<p>Why would such terminology and discrimination be useful for microarray hybridization studies? Two probes might have similar absolute, but different relative, cross-hybridization values. The latter is more important for probe design than the former. Relative cross-hybridization in an optimal probe should be low, whereas the same is not necessary for absolute cross-hybridization. Moreover, probes with low absolute cross-hybridization might have a low specific signal component and might be unsuitable for sensitive target detection and, consequently, for array design. A limited number of studies have analyzed relationships between probes’ sequence characteristics and target specific and/or cross-hybridization signals: two publications describe such analysis for 50-nt [
<xref rid="pone.0199162.ref036" ref-type="bibr">36</xref>
] and 25-nt [
<xref rid="pone.0199162.ref037" ref-type="bibr">37</xref>
] oligoprobes, respectively.</p>
<p>This study describes the analysis of relationships between sequence characteristics and hybridization signals of probes. The focus of this study is not limited to hybridization specificity or cross-hybridization signals. We have concentrated on the difference between absolute and relative cross-hybridizations and their divergent behavior according to changes in various sequence characteristics. These findings could be used for further optimization of recent advanced probe-target hybridization models [
<xref rid="pone.0199162.ref035" ref-type="bibr">35</xref>
] as well as for improvement of probes’ design.</p>
</sec>
<sec sec-type="materials|methods" id="sec002">
<title>Materials and methods</title>
<sec id="sec003">
<title>Hybridization database</title>
<p>In normal human somatic chromosomes, each gene is represented by two copies (
<xref ref-type="fig" rid="pone.0199162.g001">Fig 1A</xref>
). In the male X chromosome, most genes are represented by one copy (
<xref ref-type="fig" rid="pone.0199162.g001">Fig 1B</xref>
). In male patients affected by Duchenne muscular dystrophy (DMD), a region of the DMD gene is deleted and consequently represented by zero copies (
<xref ref-type="fig" rid="pone.0199162.g001">Fig 1C</xref>
).</p>
<fig id="pone.0199162.g001" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0199162.g001</object-id>
<label>Fig 1</label>
<caption>
<title>Hybridization experiment scheme with tiling microarray.</title>
<p>The DNA target region is represented by (A) two gene copies, (B) one copy or (C) no gene—one copy (X chromosome) combined with deletion of DMD gene fragment in a male patient.</p>
</caption>
<graphic xlink:href="pone.0199162.g001"></graphic>
</fig>
<p>We analyzed hybridization data from an experiment performed with DNA from a DMD-affected male patient. A large part of the DMD gene in the X chromosome was deleted from the patient’s DNA. Consequently, the oligoprobes targeting the deleted region of the DMD gene were without specific targets and produced only genomic cross-hybridization signals. Oligoprobes that targeted the non-deleted region in the X chromosome, where a specific target is present, represented the sum of target-specific and cross-hybridization signals. This sum is referred to as the “overall hybridization” in this study. Each set of probes with and without targets included 10
<sup>4</sup>
data points from the same hybridization experiment, performed with the same chip (
<xref ref-type="fig" rid="pone.0199162.g001">Fig 1C</xref>
). The probe dataset was provided courtesy of the Department of Human Genetics, University of Utah (the patient provided written consent for using his bio-samples for genomic and genetic research; data are available by requests). The standard Affymetrix protocol was used for genomic DNA amplification and hybridization at 45°C. Hybridization was performed using a tiling array Gene Chip Human Mapping 100K Set.</p>
</sec>
<sec id="sec004">
<title>Definitions</title>
<p>The main hybridization probes’ characteristics of the study are illustrated graphically in
<xref ref-type="fig" rid="pone.0199162.g002">Fig 2</xref>
. Based on previous studies [
<xref rid="pone.0199162.ref037" ref-type="bibr">37</xref>
] we assume that for sets of probes with very similar sequence characteristics averaged genome-wide cross-hybridization signals are also similar, regardless of whether these probes were with or without targets.</p>
<fig id="pone.0199162.g002" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0199162.g002</object-id>
<label>Fig 2</label>
<caption>
<title>Schematic representation of probes’ hybridization characteristics.</title>
</caption>
<graphic xlink:href="pone.0199162.g002"></graphic>
</fig>
<p>
<underline>Overall hybridization (O)</underline>
is represented by signals from probes with targets. Overall hybridization combines target specific and cross-hybridization signals.</p>
<p>
<underline>Absolute cross-hybridization (A)</underline>
is represented by signals that derive from interactions of probes with partially complemented (off target) sequences only, e.g. without fully complemented targets.</p>
<p>
<underline>Relative cross-hybridization (R)</underline>
is the proportion of absolute cross-hybridization in the overall hybridization signal:
<italic>R = A/O</italic>
</p>
<p>
<underline>Target Specific hybridization (Sh</underline>
) is the difference between overall hybridization and absolute cross-hybridization:
<italic>Sh = O</italic>
<italic>A</italic>
</p>
<p>
<underline>Hybridization specificity (HS)</underline>
is the ratio between target-specific and absolute cross-hybridizations:
<italic>HS</italic>
=
<italic>Sh/A</italic>
= (
<italic>O-A</italic>
)/
<italic>A</italic>
=
<italic>1/R-1</italic>
</p>
</sec>
<sec id="sec005">
<title>Sequence characteristics of probes</title>
<sec id="sec006">
<title>Genomic occurrence of
<italic>k</italic>
-mers</title>
<p>We downloaded publicly available human genome sequences for the GRCh38.p7 version of the genome assembly
<ext-link ext-link-type="ftp" xlink:href="ftp://ncbi.nlm.nih.gov/genomes/H_sapiens/">ftp://ncbi.nlm.nih.gov/genomes/H_sapiens/</ext-link>
and created a table of occurrences of
<italic>k</italic>
-mers (where 7 ≤
<italic>k</italic>
≤ 11) for the human chromosomes. The frequencies of
<italic>k</italic>
-mers (where 7 ≤
<italic>k</italic>
≤ 11) we define as a number of occurrences of the
<italic>k</italic>
-mer normalized by total number of all
<italic>k</italic>
-mers occurrences in the human chromosomes. For each oligonucleotide probe in our set of 20K oligonucleotides, we calculated the minimum, maximum and total number of occurrence of all
<italic>k</italic>
-mers: fifteen 11-mers, sixteen 10-mers, seventeen 9-mers, eighteen 8-mers and nineteen 7-mers, presented in the 25-nt oligonucleotide passenger strand. Genomic occurrence of each
<italic>k</italic>
-mer in an oligoprobe was calculated using in-house scripts [
<xref rid="pone.0199162.ref038" ref-type="bibr">38</xref>
<xref rid="pone.0199162.ref040" ref-type="bibr">40</xref>
]. In this study, “genomic occurrence of 11-mers” was assigned to each probe as a minimum among all 11-mers in a probe that reflects the accessibility of the most unique seed region of the probe.</p>
</sec>
<sec id="sec007">
<title>Estimation of
<italic>S</italic>
function, theoretical hybridization specificity</title>
<p>We estimate theoretical values of
<italic>S</italic>
function using the earlier described model and approach for calculation of predicted probes’ specificity [
<xref rid="pone.0199162.ref037" ref-type="bibr">37</xref>
]. Calculation of theoretical hybridization specificity
<italic>S</italic>
is based on the following formulas:
<disp-formula id="pone.0199162.e001">
<alternatives>
<graphic xlink:href="pone.0199162.e001.jpg" id="pone.0199162.e001g" mimetype="image" position="anchor" orientation="portrait"></graphic>
<mml:math id="M1">
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>o</mml:mi>
<mml:mi>l</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>g</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>e</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mo>Δ</mml:mo>
<mml:mi>G</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mi>p</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>c</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:mi>R</mml:mi>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:msup>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>X</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>o</mml:mi>
<mml:mi>l</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>g</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfrac>
<mml:mo>,</mml:mo>
<mml:mspace width="28pt"></mml:mspace>
<mml:mtext>where</mml:mtext>
<mml:mspace width="12pt"></mml:mspace>
<mml:msub>
<mml:mrow>
<mml:mi>X</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>o</mml:mi>
<mml:mi>l</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>g</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mo stretchy="false"></mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>e</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mo>Δ</mml:mo>
<mml:mi>G</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>s</mml:mi>
<mml:mi>s</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:mi>R</mml:mi>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:mrow>
</mml:math>
</alternatives>
</disp-formula>
where for each oligo, function
<italic>S</italic>
is the ratio between predicted specific signal (numerator) and predicted accumulative cross-hybridization signal (denominator);
<italic>ΔG</italic>
<sub>
<italic>spe</italic>
c</sub>
is the free energy change related to the reaction of fully paired duplex formation between an oligoprobe and target sequence;
<italic>ΔG</italic>
<sub>
<italic>cross</italic>
</sub>
is the free energy change related to the reaction of duplex formation between an oligoprobe and partially complementary sequence in genomic DNA;
<italic>X</italic>
<sub>
<italic>olig</italic>
</sub>
is the theoretical estimation of the accumulative cross-hybridization component of the oligonucleotide. Assuming every target has a core of 7-11nt of exact complementarity, we can estimate
<italic>X</italic>
<sub>
<italic>olig</italic>
</sub>
by the following expression:
<disp-formula id="pone.0199162.e002">
<alternatives>
<graphic xlink:href="pone.0199162.e002.jpg" id="pone.0199162.e002g" mimetype="image" position="anchor" orientation="portrait"></graphic>
<mml:math id="M2">
<mml:msub>
<mml:mrow>
<mml:mi>X</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>o</mml:mi>
<mml:mi>l</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>g</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:munder>
<mml:mrow>
<mml:mtext>max</mml:mtext>
</mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:munder>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>N</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="normal">*</mml:mi>
<mml:mi>e</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mo>Δ</mml:mo>
<mml:mi>G</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:mi>R</mml:mi>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:msup>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mspace width="10pt"></mml:mspace>
<mml:mtext>where</mml:mtext>
<mml:mspace width="10pt"></mml:mspace>
<mml:mi>n</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>7</mml:mn>
<mml:mo>.</mml:mo>
<mml:mo>.</mml:mo>
<mml:mn>11</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>.</mml:mo>
<mml:mo>.</mml:mo>
<mml:mo>(</mml:mo>
<mml:mn>25</mml:mn>
<mml:mo>-</mml:mo>
<mml:mi>n</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>)</mml:mo>
</mml:math>
</alternatives>
</disp-formula>
where
<italic>i</italic>
is a position of
<italic>n</italic>
-mer in the oligonucleotide, and
<italic>N</italic>
<sub>
<italic>n</italic>
,
<italic>i</italic>
</sub>
− the total number of occurrences of
<italic>n</italic>
-mer (started from
<italic>i</italic>
position of
<italic>n</italic>
-mer in the oligonucleotide) in the human genome.</p>
</sec>
<sec id="sec008">
<title>Calculation of nucleotide composition bias for oligoprobes</title>
<p>We have recently evaluated the sequence complexity and sequence asymmetry (SC and SA) scores [
<xref rid="pone.0199162.ref037" ref-type="bibr">37</xref>
] based on the nucleotide occurrence in a probe. In this study, a similar approach is used for the estimation of oligoprobe nucleotide composition bias using sequence asymmetry and simplicity score (SAS).</p>
<p>The sum of the squared differences between frequencies of A and T nucleotides and of G and C nucleotides represents the SAS score of a probe:
<italic>SAS</italic>
= (
<italic>f</italic>
<sub>
<italic>A</italic>
</sub>
<italic>f</italic>
<sub>
<italic>T</italic>
</sub>
)
<sup>2</sup>
+ (
<italic>f</italic>
<sub>
<italic>G</italic>
</sub>
<italic>f</italic>
<sub>
<italic>C</italic>
</sub>
)
<sup>2</sup>
, where
<italic>SAS</italic>
is the sequence asymmetry and simplicity score of a probe, and
<italic>f</italic>
<sub>
<italic>N</italic>
</sub>
is a frequency of
<italic>N</italic>
nucleotide (
<italic>N</italic>
=
<italic>A</italic>
,
<italic>T</italic>
,
<italic>G</italic>
or
<italic>C</italic>
) in a probe.</p>
<p>Theoretically, the SAS score’s range varies from 0 to 1, where 0 corresponds to probes with equal proportions of each nucleotide and 1 corresponds to a probe that consists of only a single type of nucleotide. So, for example a score of 1 would represent a probe comprised entirely of As. In this database, the SAS score values range from 0.0015 to 0.4.</p>
</sec>
<sec id="sec009">
<title>Oligo probe self-folding potential (secondary structure) and binding energy (probe-target duplex stability)</title>
<p>The probe’s self-folding and binding energies were evaluated by calculating
<italic>ΔG</italic>
<sub>folding</sub>
and
<italic>ΔG</italic>
<sub>binding</sub>
respectively.
<italic>ΔG</italic>
<sub>folding</sub>
was calculated by the A-Fold software [
<xref rid="pone.0199162.ref041" ref-type="bibr">41</xref>
<xref rid="pone.0199162.ref042" ref-type="bibr">42</xref>
], while
<italic>ΔG</italic>
<sub>binding</sub>
was evaluated using in-house scripts and previously published nearest neighbor parameters [
<xref rid="pone.0199162.ref043" ref-type="bibr">43</xref>
<xref rid="pone.0199162.ref045" ref-type="bibr">45</xref>
].</p>
</sec>
<sec id="sec010">
<title>Binning and averaging procedure</title>
<p>Averaging of hybridization signals from consecutively positioned tiled probes targeting the same gene is a routine procedure for CGH data analysis. Such averaging diminishes problems related to signal variability from individual probes, and in turn improves detection of deleted gene regions versus those with variable copy numbers. We applied a signal averaging procedure to analyzed data, where we sorted probes into bins according to their sequence characteristics and then averaged signals from each bin. The list of probe characteristics that was used as categorization criteria for binning and averaging included G-count, GGG-block count, nucleotide composition bias measured as SAS score (see below), self-folding, genomic occurrence of 11-mers, binding energy (
<italic>ΔG</italic>
) and theoretically estimated function
<italic>S</italic>
, where
<italic>S</italic>
is calculated based on the combined evaluations of probe binding energy and genomic occurrence of
<italic>k</italic>
-mers (see above).</p>
</sec>
</sec>
</sec>
<sec sec-type="results" id="sec011">
<title>Results</title>
<p>In this study, we discriminate between absolute and relative cross-hybridization values using two types of probes with signals derived from (i) a combination of specific signal and cross-hybridization or (ii) genomic cross-hybridization only. Absolute cross-hybridization applies to cross-hybridization signal that derives from contributions of all off-target interactions of each probe. The relative cross-hybridization term applies to a proportion of absolute cross-hybridization in an overall probe’s signal, which includes two components: the target specific signal and cross-hybridization. Therefore, while the absolute cross-hybridization represents a signal, the relative one represents a calculated signals’ ratio.</p>
<p>All probes from the database were categorized into bins according to their sequence characteristics and hybridization signals in each bin were averaged. The difference between bins with averaged signals of probes with and without a target represents averaged specific hybridization for all probes characterized by similar sequence characteristic. The calculation of the difference was used for evaluation of hybridization specificity, which is a ratio between specific- and absolute cross-hybridizations. The binning and averaging approach allowed finding and analyzing trends in relationships between probes’ hybridization and sequence characteristics without knowledge of all hybridization characteristics of each individual probe.</p>
<sec id="sec012">
<title>Sequence characteristics of probes that affect hybridization signals</title>
<sec id="sec013">
<title>G-effects</title>
<p>Hybridization specificity is negatively affected by high G-count and/or by G-block presence in a probe [
<xref rid="pone.0199162.ref037" ref-type="bibr">37</xref>
]. We collectively refer to this negative dependence as “G-effects”. The effects become stronger when G-count or G-block numbers increase in the probe. G-effects are responsible for high absolute (
<xref ref-type="fig" rid="pone.0199162.g003">Fig 3</xref>
) and high relative cross-hybridizations (Figure A-D in
<xref ref-type="supplementary-material" rid="pone.0199162.s001">S1 Fig</xref>
). G-count in probes varies from 0 to 15. Number of GGG blocks varies from 0 to 5. The sets of probes with G-count above 7nt (
<xref ref-type="fig" rid="pone.0199162.g003">Fig 3A</xref>
) and with more than one G-block (
<xref ref-type="fig" rid="pone.0199162.g003">Fig 3B and 3C</xref>
) have the lowest hybridization specificity. The ratios between specific and cross-hybridizations differ 10 times between probes with two GGG-blocks and without GGG-blocks.</p>
<fig id="pone.0199162.g003" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0199162.g003</object-id>
<label>Fig 3</label>
<caption>
<title>G-effects influence on hybridization specificity.</title>
<p>Categorization of oligoprobes according to numbers of G nucleotides (A) or GGG-blocks (B) is presented. Averaged hybridization signals were calculated for each bin and their values are shown along primary Y axes as columns of assorted colors. Overall (purple) hybridization and cross-hybridization (dark blue) are shown on the upper panels; specific hybridization (pink) is presented on the middle panels. Hybridization specificity (black), defined as ratio between specific and cross-hybridization, is presented on the lower panels. Numbers of the probes in each bin are shown along the secondary Y axes as light blue columns on the upper panels. A. Relationship between probes’ hybridization signals and the numbers of G nucleotides. B. Relationship between probes’ hybridization signals and the numbers of GGG-blocks in the positions of the probe. Probes were categorized into bins according to GGG-block counts in the probe’s positions from 1 to 23 (numeration from probe’s 5’ end). Location of GGG-blocks was assigned for three groups: for position 1, positions from 2
<sup>nd</sup>
to 4
<sup>th</sup>
, and positions from 5 to 23
<sup>rd</sup>
. The left histograms show averaged signals for probes with GGG-blocks located at the first position of the probe, middle histograms show results for probes with GGG-blocks located in the 2
<sup>nd</sup>
to 4
<sup>th</sup>
positions. The numbers of GGG-blocks in any position from the 5
<sup>th</sup>
to 23
<sup>rd</sup>
is presented on the right histogram.</p>
</caption>
<graphic xlink:href="pone.0199162.g003"></graphic>
</fig>
<p>Multiple studies suggest that array probes with four G nucleotides in a row (GGGG-block) are responsible for hybridization artifacts because of their involvement in non-WC interactions [
<xref rid="pone.0199162.ref012" ref-type="bibr">12</xref>
<xref rid="pone.0199162.ref018" ref-type="bibr">18</xref>
]. We found that probes with three Gs in a row behave almost as poorly as those with four (compare
<xref ref-type="fig" rid="pone.0199162.g004">Fig 4</xref>
top and bottom histograms). These probes have high absolute cross-hybridization, especially if a G-block is located at probes’ 5’ end, which gradually diminishes as the position of the G-block moves from the 5’ end toward the 3’ end (
<xref ref-type="fig" rid="pone.0199162.g004">Fig 4</xref>
). The probes with one or more GGG blocks have very low hybridization specificity and, consequently, very high absolute and relative cross-hybridizations (
<xref ref-type="fig" rid="pone.0199162.g003">Fig 3B and 3C</xref>
).</p>
<fig id="pone.0199162.g004" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0199162.g004</object-id>
<label>Fig 4</label>
<caption>
<title>Averaged cross-hybridization signals of probes with various locations of GGG(G)-blocks.</title>
<p>The upper panel shows results of probes’ binning and averaging for GGG-blocks and the lower panel for GGGG-blocks along the probe positions from the 1
<sup>st</sup>
to 23
<sup>rd</sup>
(5’→ 3’). Averaged absolute cross-hybridization signals are shown along the primary Y axes as dark blue columns and the numbers of probes in each bin are shown along the secondary Y axes as light blue columns.</p>
</caption>
<graphic xlink:href="pone.0199162.g004"></graphic>
</fig>
</sec>
<sec id="sec014">
<title>SAS score</title>
<p>We measured each probe’s nucleotide bias by calculating the probe’s SAS score (see
<xref ref-type="sec" rid="sec002">Materials and methods</xref>
), which varies from 0.0015 to 0.4 correspondingly. SAS score correlates negatively with hybridization specificity and positively with absolute (
<xref ref-type="fig" rid="pone.0199162.g005">Fig 5</xref>
, left panel) and with relative cross-hybridization (Figure E in
<xref ref-type="supplementary-material" rid="pone.0199162.s001">S1 Fig</xref>
).</p>
<fig id="pone.0199162.g005" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0199162.g005</object-id>
<label>Fig 5</label>
<caption>
<title>Nucleotide bias and folding potential affect hybridization specificity.</title>
<p>Oligoprobes were categorized into bins according to nucleotide bias (SAS-score, left panel) and self-folding vulnerability (right panel). Averaged hybridization signals are shown along the primary Y axis and numbers of probes in each bin are shown along secondary Y axes as light blue columns on the top histograms. Averaged values of overall (dark blue) and cross-hybridization (grey) signals are shown on the top histograms, specific hybridization (pink) is shown on the middle histograms and hybridization specificity (black) is present on the bottom histograms. Numbers of probes in each bin are shown along the secondary Y axis as light blue columns on the top histograms.</p>
</caption>
<graphic xlink:href="pone.0199162.g005"></graphic>
</fig>
</sec>
<sec id="sec015">
<title>Self-folding</title>
<p>Open probes with low self-folding potential are more specific with low absolute (
<xref ref-type="fig" rid="pone.0199162.g005">Fig 5</xref>
, right panel) and relative cross-hybridizations (Figure F in
<xref ref-type="supplementary-material" rid="pone.0199162.s001">S1 Fig</xref>
).
<italic>ΔG</italic>
values of probe’s self-folding varies from -14 to 0 kcal/mol. The hybridization specificity of comparatively open probes (-
<italic>ΔG</italic>
≤ 2 kcal/mol) is at least twice as high versus the specificity of those with high self-folding vulnerability (-
<italic>ΔG</italic>
> 3 kcal/mol).</p>
</sec>
<sec id="sec016">
<title>Genomic occurrence of
<italic>k</italic>
-mers</title>
<p>We found that measurement of genomic occurrence of 11-mers in oligoprobes can be used for further increasing hybridization specificity. The minimum values of genomic occurrence of all 11-mers in a probe are particularly suitable for this purpose. Filtering out all probes with minimum values above 250 caused a decrease of cross-hybridization and an increase of hybridization specificity (
<xref ref-type="fig" rid="pone.0199162.g006">Fig 6D</xref>
).</p>
<fig id="pone.0199162.g006" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0199162.g006</object-id>
<label>Fig 6</label>
<caption>
<title>Categorization of probes by binding energy after filtration using all analyzed parameters.</title>
<p>Oligoprobes were categorized into bins according to their binding energy. Averaged hybridization signals and other hybridization characteristics calculated for each bin are shown as columns of assorted colors. The columns with variable shades of a particular color illustrate the filtration process that gradually removes from the database all probes with a defined sequence characteristic, which negatively affect hybridization specificity because of involvement in parallel hybridization reactions. Colors of the columns becomes lighter after each filtration step. The darkest columns represent all probes from the database, while lightest columns represent probes after all filtration steps. The filtration process entails the following sequential probe removal steps: 8 or more of G in a sequence, at least one GGG block, SAS score above 0.05, -
<italic>ΔG</italic>
folding above 1.5 kcal/mol and minimum genomic occurrence among all 11-mers in oligo above 250. A. Overall hybridization (derived from the dataset of probes with specific targets). B. Cross-hybridization (absolute) (derived from the dataset of probes without specific targets). C. Specific hybridization (represented by subtraction values between overall and cross-hybridizations). D. Hybridization specificity (represented by ratio between specific and cross-hybridization). E. The percentage of probes (from complete dataset) in each bin.</p>
</caption>
<graphic xlink:href="pone.0199162.g006"></graphic>
</fig>
</sec>
<sec id="sec017">
<title>Binding energy</title>
<p>Probes’ binding energy
<italic>ΔG</italic>
values vary from -16 to -33 kcal/mol. Probes’ overall hybridization and their absolute cross-hybridization values increase with probe binding energy (
<xref ref-type="fig" rid="pone.0199162.g006">Fig 6A and 6B</xref>
). Probes’ specific hybridization (
<xref ref-type="fig" rid="pone.0199162.g006">Fig 6C</xref>
) and hybridization specificity (
<xref ref-type="fig" rid="pone.0199162.g006">Fig 6D</xref>
) demonstrates a growing trend. In contrast, relative cross-hybridization has a descending trend (Figure G in
<xref ref-type="supplementary-material" rid="pone.0199162.s001">S1 Fig</xref>
). The specificity of probes with a particularly high binding energy (26 ≤ -
<italic>ΔG</italic>
≤ 28.5 kcal/mol) is at least three times higher versus the specificity of probes with low binding energy (18 ≤ -
<italic>ΔG</italic>
≤ 21.5 kcal/mol). Moreover, hybridization specificity of probes with the lowest binding energy (-
<italic>ΔG</italic>
< 18 kcal/mol) is close to zero (
<xref ref-type="fig" rid="pone.0199162.g006">Fig 6D</xref>
). Approximately 70% of probes in the database have -
<italic>ΔG</italic>
< 26 kcal/mol (
<xref ref-type="fig" rid="pone.0199162.g006">Fig 6E</xref>
).</p>
<p>All sequence characteristics described above, except binding energy, were used for filtering of oligoprobes with low hybridization specificity. After each filtering step, we reanalyzed hybridization signals of remaining probes, separated into bins according to their binding energy. Hybridization specificity of the remaining probes increased after extraction of probes with low specificity due to G-effects, high nucleotide bias or sequence asymmetry (SAS-score), and high self-folding potential. Each step of probe removal improved specificity of remaining probes, independent from the order of the applied filtration parameters. The effect that increased specificity was more pronounced among the probes with high binding energy. The specificity of probes with maximal binding energy (26 ≤ -
<italic>ΔG</italic>
≤ 28.5 kcal/mol) more than doubled after filtration steps described above were applied (
<xref ref-type="fig" rid="pone.0199162.g006">Fig 6D</xref>
).</p>
<p>The removal of probes involved in parallel hybridization reactions, before probe categorization according to binding energy, unmasks trends in behavior of the remaining probes. Among these remaining probes the relationship between binding energy and hybridization specificity is stronger (
<xref ref-type="fig" rid="pone.0199162.g006">Fig 6D</xref>
).</p>
</sec>
<sec id="sec018">
<title>Estimation of the
<italic>S</italic>
function</title>
<p>Based on
<italic>ΔG</italic>
and genome occurrence of
<italic>k</italic>
-mers included in the oligoprobes, the theoretical prediction of cross-hybridization,
<italic>X</italic>
<sub>
<italic>olig</italic>
</sub>
, was estimated using a modification of recently published model (see
<xref ref-type="sec" rid="sec002">Material & methods</xref>
, [
<xref rid="pone.0199162.ref037" ref-type="bibr">37</xref>
]). The estimated values significantly correlated to experimental cross hybridization signals (
<italic>R</italic>
= 0.6,
<italic>P</italic>
< 3*10
<sup>−10</sup>
) and could be used for the prediction of model experiments. We also estimated the function
<italic>S</italic>
, related to the predicted probe’s specificity, using the earlier described approach [
<xref rid="pone.0199162.ref037" ref-type="bibr">37</xref>
], where
<italic>X</italic>
<sub>
<italic>olig</italic>
</sub>
, the theoretical estimation of accumulative cross-hybridization, was used as the denominator (see
<xref ref-type="sec" rid="sec002">Material & methods</xref>
).</p>
<p>We grouped oligoprobes according to the estimated function
<italic>S</italic>
(
<xref ref-type="fig" rid="pone.0199162.g007">Fig 7</xref>
), which can vary from 5 to 16. We showed that increases in bin ranges for the function
<italic>S</italic>
are simultaneously accompanied by an increase in averaged values of both overall hybridization and absolute cross-hybridization signals (
<xref ref-type="fig" rid="pone.0199162.g007">Fig 7A and 7B</xref>
). Probes’ specific hybridization (
<xref ref-type="fig" rid="pone.0199162.g007">Fig 7C</xref>
) and hybridization specificity, which is a ratio of specific- versus cross- hybridization (
<xref ref-type="fig" rid="pone.0199162.g007">Fig 7D</xref>
), also demonstrated a growing trend. However, the hybridization specific signal increases significantly faster in comparison to the cross-hybridization (
<xref ref-type="fig" rid="pone.0199162.g007">Fig 7A and 7B</xref>
). The exclusion of probes with negative characteristics from the analysis resulted in the pronounced mutual dependence between hybridization specificity and the function
<italic>S</italic>
(
<xref ref-type="fig" rid="pone.0199162.g007">Fig 7D</xref>
).</p>
<fig id="pone.0199162.g007" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0199162.g007</object-id>
<label>Fig 7</label>
<caption>
<title>Categorization of probes by theoretically estimated
<italic>S</italic>
function after filtration using all analyzed parameters.</title>
<p>Oligoprobes were categorized into bins according to the estimated
<italic>S</italic>
function, where
<italic>S</italic>
is calculated as a combination of genomic occurrence of all
<italic>k</italic>
-mers (7 ≤
<italic>k</italic>
≤ 11) in an oligoprobe and their binding energy (see
<xref ref-type="sec" rid="sec002">Material and methods</xref>
for details). Averaged hybridization signals and other hybridization characteristics calculated for each bin are shown as columns of assorted colors depending on the bin type. The columns with variable shades of a particular color illustrate the filtration process that gradually removes from the database all probes with a defined sequence characteristic, which negatively affect hybridization specificity because of involvement in parallel hybridization reactions. Colors of the columns become lighter after each filtration step. The darkest columns represent all probes from the database, while lightest columns represent probes after all filtration steps. The filtration process entails the following sequential probe removal steps: 8 or more of G in a sequence, at least one GGG block, SAS score above 0.05, -
<italic>ΔG</italic>
folding above 1.5 kcal/mol. A. Overall hybridization (derived from the probe’s dataset with specific targets). B. Cross-hybridization (absolute) (derived from the dataset of probes without specific targets). C. Specific hybridization (represented by subtraction values between overall and cross-hybridizations). D. Hybridization specificity (represented by ratio between specific and cross-hybridization). E. The percentage of probes (from complete dataset) in each bin.</p>
</caption>
<graphic xlink:href="pone.0199162.g007"></graphic>
</fig>
</sec>
</sec>
<sec id="sec019">
<title>Inter-relationships between probes hybridization and sequence characteristics</title>
<sec id="sec020">
<title>Absolute and relative cross-hybridization</title>
<p>The study of trends of absolute and relative cross-hybridization values shows that considering both parameters at once can result in conflicting trend outcomes. Thus, the study highlights the need to differentiate the two terms. Analysis of the binned averaged signals of probes revealed that both absolute and relative cross-hybridizations have growing trends along with G-count or an increase in self-folding vulnerability (Figure A-C in
<xref ref-type="supplementary-material" rid="pone.0199162.s002">S2 Fig</xref>
). Both showed a growing trend with an increase in SAS score, so smaller nucleotide bias corresponds to smaller absolute and relative cross-hybridization (Figure B in
<xref ref-type="supplementary-material" rid="pone.0199162.s002">S2 Fig</xref>
). In contrast, both variables showed opposite trends with an increase in probes’ binding energy; absolute cross-hybridization has a growing trend, whereas relative has a decreasing trend (Figure D in
<xref ref-type="supplementary-material" rid="pone.0199162.s002">S2 Fig</xref>
).</p>
</sec>
<sec id="sec021">
<title>Hybridization specificity and absolute cross-hybridization</title>
<p>Both hybridization characteristics might change according to either similar or different trends, depending on the probes’ categorization. Hybridization specificity has a descending trend, while absolute cross-hybridization has a growing trend, along with the probes’ categorization according to the G-count or self-folding vulnerability increase (Figure A-C in
<xref ref-type="supplementary-material" rid="pone.0199162.s003">S3 Fig</xref>
). Absolute cross-hybridization has a growing trend, while hybridization specificity has a decreasing trend along with an increase of the SAS score in bins (Figure B in
<xref ref-type="supplementary-material" rid="pone.0199162.s003">S3 Fig</xref>
). Consequently, hybridization specificity and absolute cross-hybridization trends of changes are opposite to each other in all the relationships mentioned above. In contrast, they are similar along with an increase of probes’ binding energy; and both hybridization specificity and absolute cross-hybridization demonstrate growing trends. However, these trends’ slopes are significantly different, since specific signal increases faster in comparison with cross-hybridization (Figure D in
<xref ref-type="supplementary-material" rid="pone.0199162.s003">S3 Fig</xref>
).</p>
</sec>
<sec id="sec022">
<title>Summary of all analyzed inter-relationships between probes sequence and hybridization characteristics</title>
<p>Analysis of all trends presented above demonstrated that absolute and relative cross-hybridization changing trends might be similar or opposite of each other depending on the sequence characteristic change. The same is true for hybridization specificity and absolute cross-hybridization. However, even when the trends are similar in directions, their magnitudes may be different. Such differences characterize the relationships between probes’ hybridization characteristics and their binding energy. The directions of all trends are summarized in
<xref ref-type="table" rid="pone.0199162.t001">Table 1</xref>
. Detailed analysis of changes in absolute and relative cross-hybridization trends according to different sequence characteristics is promising for optimization of oligoprobe design.</p>
<table-wrap id="pone.0199162.t001" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0199162.t001</object-id>
<label>Table 1</label>
<caption>
<title>Relationships between hybridization values and probes’ sequence characteristics.</title>
</caption>
<alternatives>
<graphic id="pone.0199162.t001g" xlink:href="pone.0199162.t001"></graphic>
<table frame="hsides" rules="groups">
<colgroup span="1">
<col align="left" valign="middle" span="1"></col>
<col align="left" valign="middle" span="1"></col>
<col align="left" valign="middle" span="1"></col>
<col align="left" valign="middle" span="1"></col>
<col align="left" valign="middle" span="1"></col>
</colgroup>
<thead>
<tr>
<th align="center" rowspan="2" colspan="1">Name of a probe sequence feature</th>
<th align="center" rowspan="2" colspan="1">Overall-combined hybridization</th>
<th align="center" rowspan="1" colspan="1">Absolute</th>
<th align="center" rowspan="1" colspan="1">Relative</th>
<th align="center" rowspan="2" colspan="1">Hybridization specificity</th>
</tr>
<tr>
<th align="center" colspan="2" rowspan="1">cross-hybridization</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" rowspan="1" colspan="1">G-count (above 3n) ↑</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">G-block presence ↑</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Positions 1–4 from 5 ' end ↑</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Positions 5–23 from 5' end ↑</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1">no change</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">SAS score ↑</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Self-folding vulnerability ↑</td>
<td align="center" rowspan="1" colspan="1">no change</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Binding energy ↑</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Genomic occurrence of
<italic>k</italic>
-mers in a probe ↑</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
</tbody>
</table>
</alternatives>
</table-wrap>
</sec>
</sec>
<sec id="sec023">
<title>Algorithm for selection of specific probes</title>
<p>Less specific probes, which are insensitive to gene copy number variations, are those that participate very little in any binding reactions and those that actively participate in off-target hybridization reactions. These probes have one or more negative characteristics: low binding energy, high self-folding vulnerability, high G-content, presence of GGG-blocks, higher SAS score and high values of 11-mers occurrence in a human genome. We suggest an algorithm for elimination of such probes from a set of design candidates. The cutoff points for input parameters could be user defined. We suggest eliminating probes with the following characteristics (default parameters): (i) -
<italic>ΔG</italic>
<sub>binding</sub>
below 21 kcal/mol or -
<italic>ΔG</italic>
<sub>folding</sub>
above 3 kcal/mol; (ii) SAS score above 0.05; (iii) G-count above 7 nt; (iv) occurrence of two GGG-blocks anywhere in a probe sequence; (v) one GGG block being present at any of the first four nucleotide positions from probe’s 5’ end; (vi) genomic occurrence of 11-mers above 250 (here and below, minimum value of genomic occurrence among all 11-mers in a probe was applied, see
<xref ref-type="sec" rid="sec002">Materials and methods</xref>
).</p>
<p>Each negative characteristic with the indicated cut-off point diminishes hybridization specificity by a factor of at least two. Thus, the hybridization specificity of remaining probes is two times higher in comparison with the hybridization specificity of filtered probes (0.25 versus 0.12). Approximately 50% of probes analyzed in this work have at least one of these negative sequence characteristics. The “specific probes” or “optimal probes” are those that do not have any negative characteristics, or by definition, these probes contain the following characteristics:</p>
<list list-type="order">
<list-item>
<p>-
<italic>ΔG</italic>
<sub>binding</sub>
above 23.5 kcal/mol;</p>
</list-item>
<list-item>
<p>-
<italic>ΔG</italic>
<sub>folding</sub>
below 1.5 kcal/mol;</p>
</list-item>
<list-item>
<p>SAS score below 0.05;</p>
</list-item>
<list-item>
<p>G-count below 8 nt;</p>
</list-item>
<list-item>
<p>no GGG-blocks.</p>
</list-item>
<list-item>
<p>genomic occurrence of 11-mers below 250.</p>
</list-item>
</list>
<p>The hybridization specificity of these selected probes is approximately two times higher in comparison with the hybridization specificity of remaining probes (0.5 versus 0.25). Approximately 4% of probes considered in this work belong to the “specific” category (hybridization specificity ~0.5).</p>
<p>There is an option to classify the oligoprobes based on the estimated values of
<italic>S</italic>
(
<xref ref-type="fig" rid="pone.0199162.g007">Fig 7</xref>
), instead of using -
<italic>ΔG</italic>
<sub>binding</sub>
and 11-mers occurrence. The analysis of optimal bins with higher values of
<italic>S</italic>
(threshold– 12.5) allows the user to extract a larger number of specific probes (8%). The default parameters for the specific probe design are following:</p>
<list list-type="order">
<list-item>
<p>
<italic>S</italic>
above 12.5;</p>
</list-item>
<list-item>
<p>-
<italic>ΔG</italic>
<sub>folding</sub>
below 1.5 kcal/mol;</p>
</list-item>
<list-item>
<p>SAS score below 0.05;</p>
</list-item>
<list-item>
<p>G-count below 8 nt;</p>
</list-item>
<list-item>
<p>no GGG-blocks.</p>
</list-item>
</list>
</sec>
</sec>
<sec sec-type="conclusions" id="sec024">
<title>Discussion</title>
<p>In this study, we demonstrated that elevated levels of hybridization specificity and absolute cross-hybridization are not mutually exclusive and may be attributed to the same probe sets. Moreover, trends in which hybridization specificity and absolute cross-hybridization are changing along with a sequence characteristic can differ significantly. They might be either both positive, either both negative, or opposite of each other, depending on the sequence probe characteristic.</p>
<p>Low cross-hybridization is not necessarily indicative of a probe’s high specificity. The low value of cross-hybridization is frequently a result of the poor ability of the probe to interact in general; in such cases, both specific and cross-hybridizations are low. Conversely, the probes that generate high specific hybridization could also generate high cross-hybridization because of their better interaction ability, both on- and off-target. It is important to differentiate between absolute and relative cross-hybridization values that also account for the specific hybridization. Some published studies suggest avoiding probes with high binding energy because of their high vulnerability to cross-hybridization [
<xref rid="pone.0199162.ref027" ref-type="bibr">27</xref>
]. Our study disproves this concept; probes with high binding energy may have high specific as well as high absolute cross-hybridization (at least for the Affymetrix platform). The ratio between specific and cross-hybridization signals is high for such probes. Thus, prediction of relative cross-hybridization is more important than prediction of absolute cross-hybridization for specific probe selection.</p>
<p>Specific probe-target interactions occur in parallel with non-specific interactions of two types, WC and non-WC. Potential off-target interactions of probes based on WC pairing are mainly evaluated through estimation of target
<italic>k</italic>
-mers in human genomes, binding energy, and partially through SAS scores and self-folding evaluation of probes, while their off-target interactions based on non-WC pairing are partially evaluated through assessment of G-effects. These evaluations are helpful for detection of probes that are involved in off-target hybridization reactions and for removal of such probes from the pool of all probes. Such filtering significantly improves hybridization specificity of the remaining probes on average, and could be used during the array design process. The improvement is more pronounced in the category of the probes with high binding energy because of the proportion of the probes involved with parallel hybridization reactions is larger in this category compared to ones with lower binding energy. These probes have high nucleotide bias: they are GC-rich and capable of stable self-folding, specifically, enriched with G and GGG blocks.</p>
<p>This study demonstrates that bins of probes with high binding energy are enriched with more specific oligos and show significantly greater averaged hybridization specificity. However, an increase of target concentration (gene copy numbers for CGH) may also affect the behavior of the probes with high binding energy, where the trend may be different, or hybridization specificity may reach a plateau. Hadiwikarta and co-authors [
<xref rid="pone.0199162.ref046" ref-type="bibr">46</xref>
] stated before that “For a given set of experimental parameters the affinity window of probe—target interaction is always limited … changes in experimental conditions can easily bring some measurements out of detection range.” High probe’s binding energy, which is optimal for evaluation of low target concentration, might be sub-optimal for high target concentrations. Moreover, if a target concentration is too high, other artifacts are possible; probes with high binding energy might achieve hybridization saturation or generate signals that are higher than a scanner’s upper detection limit. Thus, array probes should be designed in a certain range of binding energies for measuring the widest range of target concentrations. Consequently, isothermal array probes (the probes that share the same melting temperatures and binding energies) [
<xref rid="pone.0199162.ref047" ref-type="bibr">47</xref>
] are not optimal for measuring target concentrations because of narrow measuring range.</p>
<p>Many bio-techniques, beyond microarrays technology, rely on specific interactions of the oligoprobe with complementary targets. Even though the results of the study presented here are derived from microarray experiments, the physicochemical principals underlying our findings are “microarray-free” and most of the steps in the design procedure may be extrapolated universally.</p>
</sec>
<sec sec-type="conclusions" id="sec025">
<title>Conclusions</title>
<p>Hybridization specificity and absolute cross-hybridization values of oligoprobes both increase with increasing binding energy of probes in the analyzed bins. We showed that high specific hybridization and high cross-hybridization are not mutually exclusive, and may be attributed to the same probe sets. In other words, the level of non-specific interactions for some molecules may be high, but the ratio between off-target and total hybridization signals may be low. This also means that the specific signal is sufficient in magnitude for a high on-target/off-target ratio, which defines interaction specificity.</p>
<p>Low hybridization specificity of a probe is related to its high self-folding vulnerability, nucleotide composition bias, G-richness and GGG-block presence. Probes with these characteristics have high absolute and high relative cross-hybridization values. High genomic occurrence of
<italic>k</italic>
-mers in an oligoprobe decreases the probe’s hybridization specificity.</p>
<p>In this study, we suggested applying the function
<italic>S</italic>
as the combination of probe binding energy and occurrence of probe’s
<italic>k</italic>
-mers (7 ≤
<italic>k</italic>
≤ 11) in the genome for efficient oligoprobe design. Along with an increase of
<italic>S</italic>
function of probes, both averaged hybridization signals (specific and cross-hybridization) exhibit growing trends, where the hybridization specific signal increases significantly faster in comparison to the cross-hybridization. The application of
<italic>S</italic>
allows extracting a larger number of specific probes, as compared to using only binding energy. Thus, the
<italic>S</italic>
function together with other described sequence characteristics are promising features for further improvement of oligoprobe design for bio-techniques that require selection of most specific probes.</p>
</sec>
<sec sec-type="supplementary-material" id="sec026">
<title>Supporting information</title>
<supplementary-material content-type="local-data" id="pone.0199162.s001">
<label>S1 Fig</label>
<caption>
<title>Relationships between relative cross-hybridization values and probes’ sequence characteristics.</title>
<p>Oligoprobes were categorized into bins according to their variable sequence characteristics. The relative cross-hybridization value was calculated for each bin as an average ratio of absolute cross-hybridization versus overall-combined hybridization value. Relationship between probes’ relative cross-hybridization values and A) the numbers of G in the probes; B) the numbers of GGG-blocks in the1
<sup>st</sup>
positions of the probes; C) the numbers of GGG-blocks located in 2
<sup>nd</sup>
to 4
<sup>th</sup>
positions of the probes; D) the numbers of GGG-blocks located in in any position from the 5
<sup>th</sup>
to 23
<sup>rd</sup>
; E) probes’ SAS score; F) self-folding vulnerabilities in the probes; G) probes’ binding energies.</p>
<p>(EPS)</p>
</caption>
<media xlink:href="pone.0199162.s001.eps">
<caption>
<p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="pone.0199162.s002">
<label>S2 Fig</label>
<caption>
<title>Absolute and relative cross-hybridizations versus probes’ sequence characteristics.</title>
<p>Oligoprobes were categorized into bins according to variable probes characteristics. Averaged absolute or relative cross-hybridization signals were calculated for each bin and their values are shown as columns. Absolute cross-hybridization values are shown on the primary Y axis and relative cross-hybridization is shown on the secondary Y axis. Categorization according to A) G-count (all probes); B) SAS score (all probes); C) self-folding vulnerability (all probes); D) probes categorization according to binding energy. Probes with characteristics that negatively affect their hybridization specificity were excluded from the categorization procedure. The exclusion criteria were -
<italic>ΔG</italic>
<sub>folding</sub>
above 1.5 kcal/mol, SAS score above 0.05, or G-count above 7 nucleotides, and GGG-block(s) presence.</p>
<p>(EPS)</p>
</caption>
<media xlink:href="pone.0199162.s002.eps">
<caption>
<p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
<supplementary-material content-type="local-data" id="pone.0199162.s003">
<label>S3 Fig</label>
<caption>
<title>Absolute cross-hybridization and hybridization specificity versus probes’ sequence characteristics.</title>
<p>Oligoprobes were categorized into bins according to variable probes’ sequence characteristics. Averaged absolute cross-hybridization signals or hybridization specificity were calculated for each bin. Absolute cross-hybridization values are shown according to primary Y axis and hybridization specificity is shown on the secondary Y axis. All probes were categorized according to A) G-count; B) SAS score; C) probes’ self-folding vulnerability; D) probes’ binding energy. Probes with characteristics that negatively affect their hybridization specificity were excluded from the categorization procedure. The exclusion criteria were the same as described in
<xref ref-type="supplementary-material" rid="pone.0199162.s002">S2 Fig</xref>
.</p>
<p>(EPS)</p>
</caption>
<media xlink:href="pone.0199162.s003.eps">
<caption>
<p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
</sec>
</body>
<back>
<ack>
<p>The research was supported by the Department of Health and Human Services (National Institutes of Health, National Library of Medicine) intramural funds (SAS and AYO).</p>
</ack>
<ref-list>
<title>References</title>
<ref id="pone.0199162.ref001">
<label>1</label>
<mixed-citation publication-type="journal">
<name>
<surname>Horak</surname>
<given-names>CE</given-names>
</name>
,
<name>
<surname>Snyder</surname>
<given-names>M</given-names>
</name>
.
<article-title>ChIP-chip: a genomic approach for identifying transcription factor binding sites</article-title>
.
<source>Methods Enzymol</source>
.
<year>2002</year>
;
<volume>350</volume>
:
<fpage>469</fpage>
<lpage>483</lpage>
.
<pub-id pub-id-type="pmid">12073330</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref002">
<label>2</label>
<mixed-citation publication-type="journal">
<name>
<surname>Duan</surname>
<given-names>F</given-names>
</name>
,
<name>
<surname>Pauley</surname>
<given-names>MA</given-names>
</name>
,
<name>
<surname>Spindel</surname>
<given-names>ER</given-names>
</name>
,
<name>
<surname>Zhang</surname>
<given-names>L</given-names>
</name>
,
<name>
<surname>Norgren</surname>
<given-names>RB</given-names>
</name>
.
<article-title>Large scale analysis of positional effects of single-base mismatches on microarray gene expression data</article-title>
.
<source>BioData Min</source>
.
<year>2010</year>
;
<volume>3</volume>
:
<fpage>2</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1186/1756-0381-3-2">10.1186/1756-0381-3-2</ext-link>
</comment>
<pub-id pub-id-type="pmid">20429935</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref003">
<label>3</label>
<mixed-citation publication-type="journal">
<name>
<surname>Gresham</surname>
<given-names>D</given-names>
</name>
,
<name>
<surname>Curry</surname>
<given-names>B</given-names>
</name>
,
<name>
<surname>Ward</surname>
<given-names>A</given-names>
</name>
,
<name>
<surname>Gordon</surname>
<given-names>DB</given-names>
</name>
,
<name>
<surname>Brizuela</surname>
<given-names>L</given-names>
</name>
,
<name>
<surname>Kruglyak</surname>
<given-names>L</given-names>
</name>
,
<etal>et al</etal>
<article-title>Optimized detection of sequence variation in heterozygous genomes using DNA microarrays with isothermal-melting probes</article-title>
.
<source>Proc Natl Acad Sci</source>
.
<year>2010</year>
;
<volume>107</volume>
:
<fpage>1482</fpage>
<lpage>1487</lpage>
.
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1073/pnas.0913883107">10.1073/pnas.0913883107</ext-link>
</comment>
<pub-id pub-id-type="pmid">20080586</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref004">
<label>4</label>
<mixed-citation publication-type="journal">
<name>
<surname>Hurd</surname>
<given-names>PJ</given-names>
</name>
,
<name>
<surname>Nelson</surname>
<given-names>CJ</given-names>
</name>
.
<article-title>Advantages of next-generation sequencing versus the microarray in epigenetic research</article-title>
.
<source>Brief Funct Genomic Proteomic</source>
.
<year>2009</year>
;
<volume>8</volume>
:
<fpage>174</fpage>
<lpage>183</lpage>
.
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1093/bfgp/elp013">10.1093/bfgp/elp013</ext-link>
</comment>
<pub-id pub-id-type="pmid">19535508</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref005">
<label>5</label>
<mixed-citation publication-type="journal">
<name>
<surname>Aston</surname>
<given-names>E</given-names>
</name>
,
<name>
<surname>Whitby</surname>
<given-names>H</given-names>
</name>
,
<name>
<surname>Maxwell</surname>
<given-names>T</given-names>
</name>
,
<name>
<surname>Glaus</surname>
<given-names>N</given-names>
</name>
,
<name>
<surname>Cowley</surname>
<given-names>B</given-names>
</name>
,
<name>
<surname>Lowry</surname>
<given-names>D</given-names>
</name>
,
<etal>et al</etal>
<article-title>Comparison of targeted and whole genome analysis of postnatal specimens using a commercially available array based comparative genomic hybridisation (aCGH) microarray platform</article-title>
.
<source>J Med Genet</source>
.
<year>2008</year>
;
<volume>45</volume>
:
<fpage>268</fpage>
<lpage>274</lpage>
.
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1136/jmg.2007.055319">10.1136/jmg.2007.055319</ext-link>
</comment>
<pub-id pub-id-type="pmid">18178633</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref006">
<label>6</label>
<mixed-citation publication-type="journal">
<name>
<surname>Ahn</surname>
<given-names>JW</given-names>
</name>
,
<name>
<surname>Mann</surname>
<given-names>K</given-names>
</name>
,
<name>
<surname>Walsh</surname>
<given-names>S</given-names>
</name>
,
<name>
<surname>Shehab</surname>
<given-names>M</given-names>
</name>
,
<name>
<surname>Hoang</surname>
<given-names>S</given-names>
</name>
,
<name>
<surname>Docherty</surname>
<given-names>Z</given-names>
</name>
,
<etal>et al</etal>
<article-title>Validation and implementation of array comparative genomic hybridisation as a first line test in place of postnatal karyotyping for genome imbalance</article-title>
.
<source>Mol Cytogenet</source>
.
<year>2010</year>
;
<volume>3</volume>
:
<fpage>9</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1186/1755-8166-3-9">10.1186/1755-8166-3-9</ext-link>
</comment>
<pub-id pub-id-type="pmid">20398301</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref007">
<label>7</label>
<mixed-citation publication-type="journal">
<name>
<surname>Jaksik</surname>
<given-names>R</given-names>
</name>
,
<name>
<surname>Iwanaszko</surname>
<given-names>M</given-names>
</name>
,
<name>
<surname>Rzeszowska-Wolny</surname>
<given-names>J</given-names>
</name>
,
<name>
<surname>Kimmel</surname>
<given-names>M</given-names>
</name>
.
<article-title>Microarray experiments and factors which affect their reliability</article-title>
.
<source>Biol Direct</source>
.
<year>2015</year>
;
<volume>10</volume>
:
<fpage>46</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1186/s13062-015-0077-2">10.1186/s13062-015-0077-2</ext-link>
</comment>
<pub-id pub-id-type="pmid">26335588</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref008">
<label>8</label>
<mixed-citation publication-type="journal">
<name>
<surname>Carter</surname>
<given-names>NP</given-names>
</name>
.
<article-title>Methods and strategies for analyzing copy number variation using DNA microarrays</article-title>
.
<source>Nat Genet</source>
.
<year>2007</year>
;
<volume>39</volume>
:
<fpage>S16</fpage>
<lpage>S21</lpage>
.
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1038/ng2028">10.1038/ng2028</ext-link>
</comment>
<pub-id pub-id-type="pmid">17597776</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref009">
<label>9</label>
<mixed-citation publication-type="journal">
<name>
<surname>Pozhitkov</surname>
<given-names>AE</given-names>
</name>
,
<name>
<surname>Noble</surname>
<given-names>PA</given-names>
</name>
,
<name>
<surname>Bryk</surname>
<given-names>J</given-names>
</name>
,
<name>
<surname>Tautz</surname>
<given-names>D</given-names>
</name>
.
<article-title>A revised design for microarray experiments to account for experimental noise and uncertainty of probe response</article-title>
.
<source>PloS One</source>
.
<year>2014</year>
;
<volume>9</volume>
:
<fpage>e91295</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1371/journal.pone.0091295">10.1371/journal.pone.0091295</ext-link>
</comment>
<pub-id pub-id-type="pmid">24618910</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref010">
<label>10</label>
<mixed-citation publication-type="journal">
<name>
<surname>Fasold</surname>
<given-names>M</given-names>
</name>
,
<name>
<surname>Binder</surname>
<given-names>H</given-names>
</name>
.
<article-title>Variation of RNA Quality and Quantity Are Major Sources of Batch Effects in Microarray Expression Data</article-title>
.
<source>Microarrays</source>
.
<year>2014</year>
;
<volume>3</volume>
:
<fpage>322</fpage>
<lpage>339</lpage>
.
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3390/microarrays3040322">10.3390/microarrays3040322</ext-link>
</comment>
<pub-id pub-id-type="pmid">27600351</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref011">
<label>11</label>
<mixed-citation publication-type="journal">
<name>
<surname>Binder</surname>
<given-names>H</given-names>
</name>
,
<name>
<surname>Preibisch</surname>
<given-names>S</given-names>
</name>
.
<article-title>“Hook”-calibration of GeneChip-microarrays: Theory and algorithm</article-title>
.
<source>Algorithms Mol Biol AMB</source>
.
<year>2008</year>
;
<volume>3</volume>
:
<fpage>12</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1186/1748-7188-3-12">10.1186/1748-7188-3-12</ext-link>
</comment>
<pub-id pub-id-type="pmid">18759985</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref012">
<label>12</label>
<mixed-citation publication-type="journal">
<name>
<surname>Wu</surname>
<given-names>C</given-names>
</name>
,
<name>
<surname>Zhao</surname>
<given-names>H</given-names>
</name>
,
<name>
<surname>Baggerly</surname>
<given-names>K</given-names>
</name>
,
<name>
<surname>Carta</surname>
<given-names>R</given-names>
</name>
,
<name>
<surname>Zhang</surname>
<given-names>L</given-names>
</name>
.
<article-title>Short oligonucleotide probes containing G-stacks display abnormal binding affinity on Affymetrix microarrays</article-title>
.
<source>Bioinformatics</source>
.
<year>2007</year>
;
<volume>23</volume>
:
<fpage>2566</fpage>
<lpage>2572</lpage>
.
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1093/bioinformatics/btm271">10.1093/bioinformatics/btm271</ext-link>
</comment>
<pub-id pub-id-type="pmid">17537749</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref013">
<label>13</label>
<mixed-citation publication-type="journal">
<name>
<surname>Upton</surname>
<given-names>G</given-names>
</name>
,
<name>
<surname>Langdon</surname>
<given-names>W</given-names>
</name>
,
<name>
<surname>Harrison</surname>
<given-names>A</given-names>
</name>
.
<article-title>G-spots cause incorrect expression measurement in Affymetrix microarrays</article-title>
.
<source>BMC Genomics</source>
.
<year>2008</year>
;
<volume>9</volume>
:
<fpage>613</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1186/1471-2164-9-613">10.1186/1471-2164-9-613</ext-link>
</comment>
<pub-id pub-id-type="pmid">19094220</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref014">
<label>14</label>
<mixed-citation publication-type="journal">
<name>
<surname>Langdon</surname>
<given-names>WB</given-names>
</name>
,
<name>
<surname>Upton</surname>
<given-names>GJG</given-names>
</name>
,
<name>
<surname>Harrison</surname>
<given-names>AP</given-names>
</name>
.
<article-title>Probes containing runs of guanines provide insights into the biophysics and bioinformatics of Affymetrix GeneChips</article-title>
.
<source>Brief Bioinform</source>
.
<year>2008</year>
;
<volume>10</volume>
:
<fpage>259</fpage>
<lpage>277</lpage>
.</mixed-citation>
</ref>
<ref id="pone.0199162.ref015">
<label>15</label>
<mixed-citation publication-type="journal">
<name>
<surname>Binder</surname>
<given-names>H</given-names>
</name>
,
<name>
<surname>Fasold</surname>
<given-names>M</given-names>
</name>
,
<name>
<surname>Glomb</surname>
<given-names>T</given-names>
</name>
.
<article-title>Mismatch and G-Stack Modulated Probe Signals on SNP Microarrays</article-title>
.
<source>PLoS ONE</source>
.
<year>2009</year>
;
<volume>4</volume>
:
<fpage>e7862</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1371/journal.pone.0007862">10.1371/journal.pone.0007862</ext-link>
</comment>
<pub-id pub-id-type="pmid">19924253</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref016">
<label>16</label>
<mixed-citation publication-type="journal">
<name>
<surname>Upton</surname>
<given-names>GJG</given-names>
</name>
,
<name>
<surname>Sanchez-Graillet</surname>
<given-names>O</given-names>
</name>
,
<name>
<surname>Rowsell</surname>
<given-names>J</given-names>
</name>
,
<name>
<surname>Arteaga-Salas</surname>
<given-names>JM</given-names>
</name>
,
<name>
<surname>Graham</surname>
<given-names>NS</given-names>
</name>
,
<name>
<surname>Stalteri</surname>
<given-names>MA</given-names>
</name>
,
<etal>et al</etal>
<article-title>On the causes of outliers in Affymetrix GeneChip data</article-title>
.
<source>Brief Funct Genomic Proteomic</source>
.
<year>2009</year>
;
<volume>8</volume>
:
<fpage>199</fpage>
<lpage>212</lpage>
.
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1093/bfgp/elp027">10.1093/bfgp/elp027</ext-link>
</comment>
<pub-id pub-id-type="pmid">19734302</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref017">
<label>17</label>
<mixed-citation publication-type="journal">
<name>
<surname>Fasold</surname>
<given-names>M</given-names>
</name>
,
<name>
<surname>Stadler</surname>
<given-names>PF</given-names>
</name>
,
<name>
<surname>Binder</surname>
<given-names>H</given-names>
</name>
.
<article-title>G-stack modulated probe intensities on expression arrays—sequence corrections and signal calibration</article-title>
.
<source>BMC Bioinformatics</source>
.
<year>2010</year>
;
<volume>11</volume>
:
<fpage>207</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1186/1471-2105-11-207">10.1186/1471-2105-11-207</ext-link>
</comment>
<pub-id pub-id-type="pmid">20423484</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref018">
<label>18</label>
<mixed-citation publication-type="journal">
<name>
<surname>Memon</surname>
<given-names>FN</given-names>
</name>
,
<name>
<surname>Upton</surname>
<given-names>GJG</given-names>
</name>
,
<name>
<surname>Harrison</surname>
<given-names>AP</given-names>
</name>
.
<article-title>A Comparative Study of the Impact of G-Stack Probes on Various Affymetrix GeneChips of Mammalia</article-title>
.
<source>J Nucleic Acids</source>
.
<year>2010</year>
;</mixed-citation>
</ref>
<ref id="pone.0199162.ref019">
<label>19</label>
<mixed-citation publication-type="journal">
<name>
<surname>Matveeva</surname>
<given-names>OV</given-names>
</name>
,
<name>
<surname>Mathews</surname>
<given-names>DH</given-names>
</name>
,
<name>
<surname>Tsodikov</surname>
<given-names>AD</given-names>
</name>
,
<name>
<surname>Shabalina</surname>
<given-names>SA</given-names>
</name>
,
<name>
<surname>Gesteland</surname>
<given-names>RF</given-names>
</name>
,
<name>
<surname>Atkins</surname>
<given-names>JF</given-names>
</name>
,
<etal>et al</etal>
<article-title>Thermodynamic criteria for high hit rate antisense oligonucleotide design</article-title>
.
<source>Nucleic Acids Res</source>
.
<year>2003</year>
;
<volume>31</volume>
:
<fpage>4989</fpage>
<lpage>4994</lpage>
.
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1093/nar/gkg710">10.1093/nar/gkg710</ext-link>
</comment>
<pub-id pub-id-type="pmid">12930948</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref020">
<label>20</label>
<mixed-citation publication-type="journal">
<name>
<surname>Gharaibeh</surname>
<given-names>RZ</given-names>
</name>
,
<name>
<surname>Fodor</surname>
<given-names>AA</given-names>
</name>
,
<name>
<surname>Gibas</surname>
<given-names>CJ</given-names>
</name>
.
<article-title>Using probe secondary structure information to enhance Affymetrix GeneChip background estimates</article-title>
.
<source>Comput Biol Chem</source>
.
<year>2007</year>
;
<volume>31</volume>
:
<fpage>92</fpage>
<lpage>98</lpage>
.
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.compbiolchem.2007.02.008">10.1016/j.compbiolchem.2007.02.008</ext-link>
</comment>
<pub-id pub-id-type="pmid">17387043</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref021">
<label>21</label>
<mixed-citation publication-type="journal">
<name>
<surname>Gharaibeh</surname>
<given-names>RZ</given-names>
</name>
,
<name>
<surname>Fodor</surname>
<given-names>AA</given-names>
</name>
,
<name>
<surname>Gibas</surname>
<given-names>CJ</given-names>
</name>
.
<article-title>Software note: using probe secondary structure information to enhance Affymetrix GeneChip background estimates</article-title>
.
<source>Comput Biol Chem</source>
.
<year>2007</year>
;
<volume>31</volume>
:
<fpage>92</fpage>
<lpage>98</lpage>
.
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.compbiolchem.2007.02.008">10.1016/j.compbiolchem.2007.02.008</ext-link>
</comment>
<pub-id pub-id-type="pmid">17387043</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref022">
<label>22</label>
<mixed-citation publication-type="journal">
<name>
<surname>Jakubek</surname>
<given-names>YA</given-names>
</name>
,
<name>
<surname>Cutler</surname>
<given-names>DJ</given-names>
</name>
.
<article-title>A model of binding on DNA microarrays: understanding the combined effect of probe synthesis failure, cross-hybridization, DNA fragmentation and other experimental details of affymetrix arrays</article-title>
.
<source>BMC Genomics</source>
.
<year>2012</year>
;
<volume>13</volume>
:
<fpage>737</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1186/1471-2164-13-737">10.1186/1471-2164-13-737</ext-link>
</comment>
<pub-id pub-id-type="pmid">23270536</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref023">
<label>23</label>
<mixed-citation publication-type="journal">
<name>
<surname>Matveeva</surname>
<given-names>OV</given-names>
</name>
,
<name>
<surname>Shabalina</surname>
<given-names>SA</given-names>
</name>
,
<name>
<surname>Nemtsov</surname>
<given-names>VA</given-names>
</name>
,
<name>
<surname>Tsodikov</surname>
<given-names>AD</given-names>
</name>
,
<name>
<surname>Gesteland</surname>
<given-names>RF</given-names>
</name>
,
<name>
<surname>Atkins</surname>
<given-names>JF</given-names>
</name>
.
<article-title>Thermodynamic calculations and statistical correlations for oligoprobes design</article-title>
.
<source>Nucleic Acids Res</source>
.
<year>2003</year>
;
<volume>31</volume>
:
<fpage>4211</fpage>
<lpage>4217</lpage>
.
<pub-id pub-id-type="pmid">12853639</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref024">
<label>24</label>
<mixed-citation publication-type="journal">
<name>
<surname>Carlon</surname>
<given-names>E</given-names>
</name>
,
<name>
<surname>Heim</surname>
<given-names>T</given-names>
</name>
.
<article-title>Thermodynamics of RNA/DNA hybridization in high-density oligonucleotide microarrays</article-title>
.
<source>Phys Stat Mech Its Appl</source>
.
<year>2006</year>
;
<volume>362</volume>
:
<fpage>433</fpage>
<lpage>449</lpage>
.</mixed-citation>
</ref>
<ref id="pone.0199162.ref025">
<label>25</label>
<mixed-citation publication-type="journal">
<name>
<surname>Weckx</surname>
<given-names>S</given-names>
</name>
,
<name>
<surname>Carlon</surname>
<given-names>E</given-names>
</name>
,
<name>
<surname>DeVuyst</surname>
<given-names>L</given-names>
</name>
,
<name>
<surname>Van Hummelen</surname>
<given-names>P</given-names>
</name>
.
<article-title>Thermodynamic behavior of short oligonucleotides in microarray hybridizations can be described using Gibbs free energy in a nearest-neighbor model</article-title>
.
<source>J Phys Chem B</source>
.
<year>2007</year>
;
<volume>111</volume>
:
<fpage>13583</fpage>
<lpage>13590</lpage>
.
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1021/jp075197x">10.1021/jp075197x</ext-link>
</comment>
<pub-id pub-id-type="pmid">17994724</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref026">
<label>26</label>
<mixed-citation publication-type="journal">
<name>
<surname>Hooyberghs</surname>
<given-names>J</given-names>
</name>
,
<name>
<surname>Van Hummelen</surname>
<given-names>P</given-names>
</name>
,
<name>
<surname>Carlon</surname>
<given-names>E</given-names>
</name>
.
<article-title>The effects of mismatches on hybridization in DNA microarrays: determination of nearest neighbor parameters</article-title>
.
<source>Nucleic Acids Res</source>
.
<year>2009</year>
;
<volume>37</volume>
:
<fpage>e53</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1093/nar/gkp109">10.1093/nar/gkp109</ext-link>
</comment>
<pub-id pub-id-type="pmid">19270064</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref027">
<label>27</label>
<mixed-citation publication-type="journal">
<name>
<surname>Wu</surname>
<given-names>C</given-names>
</name>
,
<name>
<surname>Carta</surname>
<given-names>R</given-names>
</name>
,
<name>
<surname>Zhang</surname>
<given-names>L</given-names>
</name>
.
<article-title>Sequence dependence of cross-hybridization on short oligo microarrays</article-title>
.
<source>Nucleic Acids Res</source>
.
<year>2005</year>
;
<volume>33</volume>
:
<fpage>e84</fpage>
<lpage>e84</lpage>
.
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1093/nar/gni082">10.1093/nar/gni082</ext-link>
</comment>
<pub-id pub-id-type="pmid">15914663</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref028">
<label>28</label>
<mixed-citation publication-type="journal">
<name>
<surname>Garhyan</surname>
<given-names>J</given-names>
</name>
,
<name>
<surname>Gharaibeh</surname>
<given-names>RZ</given-names>
</name>
,
<name>
<surname>McGee</surname>
<given-names>S</given-names>
</name>
,
<name>
<surname>Gibas</surname>
<given-names>CJ</given-names>
</name>
.
<article-title>The illusion of specific capture: surface and solution studies of suboptimal oligonucleotide hybridization</article-title>
.
<source>BMC Res Notes</source>
.
<year>2013</year>
;
<volume>6</volume>
:
<fpage>72</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1186/1756-0500-6-72">10.1186/1756-0500-6-72</ext-link>
</comment>
<pub-id pub-id-type="pmid">23445545</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref029">
<label>29</label>
<mixed-citation publication-type="journal">
<name>
<surname>Kapur</surname>
<given-names>K</given-names>
</name>
,
<name>
<surname>Jiang</surname>
<given-names>H</given-names>
</name>
,
<name>
<surname>Xing</surname>
<given-names>Y</given-names>
</name>
,
<name>
<surname>Wong</surname>
<given-names>WH</given-names>
</name>
.
<article-title>Cross-hybridization modeling on Affymetrix exon arrays</article-title>
.
<source>Bioinformatics</source>
.
<year>2008</year>
;
<volume>24</volume>
:
<fpage>2887</fpage>
<lpage>2893</lpage>
.
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1093/bioinformatics/btn571">10.1093/bioinformatics/btn571</ext-link>
</comment>
<pub-id pub-id-type="pmid">18984598</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref030">
<label>30</label>
<mixed-citation publication-type="journal">
<name>
<surname>Gräf</surname>
<given-names>S</given-names>
</name>
,
<name>
<surname>Nielsen</surname>
<given-names>FGG</given-names>
</name>
,
<name>
<surname>Kurtz</surname>
<given-names>S</given-names>
</name>
,
<name>
<surname>Huynen</surname>
<given-names>MA</given-names>
</name>
,
<name>
<surname>Birney</surname>
<given-names>E</given-names>
</name>
,
<name>
<surname>Stunnenberg</surname>
<given-names>H</given-names>
</name>
,
<etal>et al</etal>
<article-title>Optimized design and assessment of whole genome tiling arrays</article-title>
.
<source>Bioinformatics Oxf Engl</source>
.
<year>2007</year>
;
<volume>23</volume>
:
<fpage>i195</fpage>
<lpage>204</lpage>
.</mixed-citation>
</ref>
<ref id="pone.0199162.ref031">
<label>31</label>
<mixed-citation publication-type="journal">
<name>
<surname>Du</surname>
<given-names>Y</given-names>
</name>
,
<name>
<surname>Murani</surname>
<given-names>E</given-names>
</name>
,
<name>
<surname>Ponsuksili</surname>
<given-names>S</given-names>
</name>
,
<name>
<surname>Wimmers</surname>
<given-names>K</given-names>
</name>
.
<article-title>Flexible and efficient genome tiling design with penalized uniqueness score</article-title>
.
<source>BMC Bioinformatics</source>
.
<year>2012</year>
;
<volume>13</volume>
:
<fpage>323</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1186/1471-2105-13-323">10.1186/1471-2105-13-323</ext-link>
</comment>
<pub-id pub-id-type="pmid">23216884</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref032">
<label>32</label>
<mixed-citation publication-type="journal">
<name>
<surname>Binder</surname>
<given-names>H</given-names>
</name>
,
<name>
<surname>Preibisch</surname>
<given-names>S</given-names>
</name>
.
<article-title>GeneChip microarrays—signal intensities, RNA concentrations and probe sequences</article-title>
.
<source>J Phys Condens Matter</source>
.
<year>2006</year>
;
<volume>18</volume>
:
<fpage>S537</fpage>
.</mixed-citation>
</ref>
<ref id="pone.0199162.ref033">
<label>33</label>
<mixed-citation publication-type="journal">
<name>
<surname>Zhang</surname>
<given-names>L</given-names>
</name>
,
<name>
<surname>Miles</surname>
<given-names>MF</given-names>
</name>
,
<name>
<surname>Aldape</surname>
<given-names>KD</given-names>
</name>
.
<article-title>A model of molecular interactions on short oligonucleotide microarrays</article-title>
.
<source>Nat Biotechnol</source>
.
<year>2003</year>
;
<volume>21</volume>
:
<fpage>818</fpage>
<lpage>821</lpage>
.
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1038/nbt836">10.1038/nbt836</ext-link>
</comment>
<pub-id pub-id-type="pmid">12794640</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref034">
<label>34</label>
<mixed-citation publication-type="journal">
<name>
<surname>Furusawa</surname>
<given-names>C</given-names>
</name>
,
<name>
<surname>Ono</surname>
<given-names>N</given-names>
</name>
,
<name>
<surname>Suzuki</surname>
<given-names>S</given-names>
</name>
,
<name>
<surname>Agata</surname>
<given-names>T</given-names>
</name>
,
<name>
<surname>Shimizu</surname>
<given-names>H</given-names>
</name>
,
<name>
<surname>Yomo</surname>
<given-names>T</given-names>
</name>
.
<article-title>Model-based analysis of non-specific binding for background correction of high-density oligonucleotide microarrays</article-title>
.
<source>Bioinformatics</source>
.
<year>2009</year>
;
<volume>25</volume>
:
<fpage>36</fpage>
<lpage>41</lpage>
.
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1093/bioinformatics/btn570">10.1093/bioinformatics/btn570</ext-link>
</comment>
<pub-id pub-id-type="pmid">18977779</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref035">
<label>35</label>
<mixed-citation publication-type="journal">
<name>
<surname>Becker</surname>
<given-names>J</given-names>
</name>
,
<name>
<surname>Pérot</surname>
<given-names>P</given-names>
</name>
,
<name>
<surname>Cheynet</surname>
<given-names>V</given-names>
</name>
,
<name>
<surname>Oriol</surname>
<given-names>G</given-names>
</name>
,
<name>
<surname>Mugnier</surname>
<given-names>N</given-names>
</name>
,
<name>
<surname>Mommert</surname>
<given-names>M</given-names>
</name>
,
<etal>et al</etal>
<article-title>A comprehensive hybridization model allows whole HERV transcriptome profiling using high density microarray</article-title>
.
<source>BMC Genomics</source>
.
<year>2017</year>
;
<volume>18</volume>
:
<fpage>286</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1186/s12864-017-3669-7">10.1186/s12864-017-3669-7</ext-link>
</comment>
<pub-id pub-id-type="pmid">28390408</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref036">
<label>36</label>
<mixed-citation publication-type="journal">
<name>
<surname>Xia</surname>
<given-names>X-Q</given-names>
</name>
,
<name>
<surname>Jia</surname>
<given-names>Z</given-names>
</name>
,
<name>
<surname>Porwollik</surname>
<given-names>S</given-names>
</name>
,
<name>
<surname>Long</surname>
<given-names>F</given-names>
</name>
,
<name>
<surname>Hoemme</surname>
<given-names>C</given-names>
</name>
,
<name>
<surname>Ye</surname>
<given-names>K</given-names>
</name>
,
<etal>et al</etal>
<article-title>Evaluating oligonucleotide properties for DNA microarray probe design</article-title>
.
<source>Nucleic Acids Res</source>
.
<year>2010</year>
;
<volume>38</volume>
:
<fpage>e121</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1093/nar/gkq039">10.1093/nar/gkq039</ext-link>
</comment>
<pub-id pub-id-type="pmid">20236987</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref037">
<label>37</label>
<mixed-citation publication-type="journal">
<name>
<surname>Matveeva</surname>
<given-names>OV</given-names>
</name>
,
<name>
<surname>Nechipurenko</surname>
<given-names>YD</given-names>
</name>
,
<name>
<surname>Riabenko</surname>
<given-names>E</given-names>
</name>
,
<name>
<surname>Ragan</surname>
<given-names>C</given-names>
</name>
,
<name>
<surname>Nazipova</surname>
<given-names>NN</given-names>
</name>
,
<name>
<surname>Ogurtsov</surname>
<given-names>AY</given-names>
</name>
,
<etal>et al</etal>
<article-title>Optimization of signal-to-noise ratio for efficient microarray probe design</article-title>
.
<source>Bioinformatics Oxf Engl</source>
.
<year>2016</year>
;
<volume>32</volume>
:
<fpage>i552</fpage>
<lpage>i558</lpage>
.</mixed-citation>
</ref>
<ref id="pone.0199162.ref038">
<label>38</label>
<mixed-citation publication-type="journal">
<name>
<surname>Matveeva</surname>
<given-names>OV</given-names>
</name>
,
<name>
<surname>Tsodikov</surname>
<given-names>AD</given-names>
</name>
,
<name>
<surname>Giddins</surname>
<given-names>M</given-names>
</name>
,
<name>
<surname>Freier</surname>
<given-names>SM</given-names>
</name>
,
<name>
<surname>Wyatt</surname>
<given-names>JR</given-names>
</name>
,
<name>
<surname>Spiridonov</surname>
<given-names>AN</given-names>
</name>
,
<name>
<surname>Shabalina</surname>
<given-names>SA</given-names>
</name>
,
<name>
<surname>Gesteland</surname>
<given-names>RF</given-names>
</name>
,
<name>
<surname>Atkins</surname>
<given-names>JF</given-names>
</name>
.
<article-title>Identification of sequence motifs in oligonucleotides whose presence is correlated with antisense activity</article-title>
.
<source>Nucleic Acids Res</source>
.
<year>2000</year>
;
<volume>28</volume>
:
<fpage>2862</fpage>
<lpage>2862</lpage>
.
<pub-id pub-id-type="pmid">10908347</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref039">
<label>39</label>
<mixed-citation publication-type="journal">
<name>
<surname>Kondrashov</surname>
<given-names>AS</given-names>
</name>
,
<name>
<surname>Shabalina</surname>
<given-names>SA</given-names>
</name>
.
<article-title>Classification of common conserved sequences in mammalian intergenic regions</article-title>
.
<source>Hum Mol Genet</source>
.
<year>2002</year>
;
<volume>11</volume>
:
<fpage>669</fpage>
<lpage>674</lpage>
.
<pub-id pub-id-type="pmid">11912182</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref040">
<label>40</label>
<mixed-citation publication-type="journal">
<name>
<surname>Webb</surname>
<given-names>CT</given-names>
</name>
,
<name>
<surname>Shabalina</surname>
<given-names>SA</given-names>
</name>
,
<name>
<surname>Ogurtsov</surname>
<given-names>AY</given-names>
</name>
,
<name>
<surname>Kondrashov</surname>
<given-names>AS</given-names>
</name>
.
<article-title>Analysis of similarity within 142 pairs of orthologous intergenic regions of Caenorhabditis elegans and Caenorhabditis briggsae</article-title>
.
<source>Nucleic Acids Res</source>
.
<year>2002</year>
;
<volume>30</volume>
:
<fpage>1233</fpage>
<lpage>1239</lpage>
.
<pub-id pub-id-type="pmid">11861916</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref041">
<label>41</label>
<mixed-citation publication-type="journal">
<name>
<surname>Ogurtsov</surname>
<given-names>AY</given-names>
</name>
,
<name>
<surname>Shabalina</surname>
<given-names>SA</given-names>
</name>
,
<name>
<surname>Kondrashov</surname>
<given-names>AS</given-names>
</name>
,
<name>
<surname>Roytberg</surname>
<given-names>MA</given-names>
</name>
.
<article-title>Analysis of internal loops within the RNA secondary structure in almost quadratic time</article-title>
.
<source>Bioinformatics Oxf Engl</source>
.
<year>2006</year>
;
<volume>22</volume>
:
<fpage>1317</fpage>
<lpage>1324</lpage>
.</mixed-citation>
</ref>
<ref id="pone.0199162.ref042">
<label>42</label>
<mixed-citation publication-type="journal">
<name>
<surname>Ogurtsov</surname>
<given-names>AY</given-names>
</name>
,
<name>
<surname>Mariño-Ramírez</surname>
<given-names>L</given-names>
</name>
,
<name>
<surname>Johnson</surname>
<given-names>GR</given-names>
</name>
,
<name>
<surname>Landsman</surname>
<given-names>D</given-names>
</name>
,
<name>
<surname>Shabalina</surname>
<given-names>SA</given-names>
</name>
,
<name>
<surname>Spiridonov</surname>
<given-names>NA</given-names>
</name>
.
<article-title>Expression patterns of protein kinases correlate with gene architecture and evolutionary rates</article-title>
.
<source>PLoS One</source>
.
<year>2008</year>
,
<volume>3</volume>
(
<issue>10</issue>
):
<fpage>e3599</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1371/journal.pone.0003599">10.1371/journal.pone.0003599</ext-link>
</comment>
<pub-id pub-id-type="pmid">18974838</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref043">
<label>43</label>
<mixed-citation publication-type="journal">
<name>
<surname>SantaLucia</surname>
<given-names>J</given-names>
</name>
.
<article-title>A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics</article-title>
.
<source>Proc Natl Acad Sci U S A</source>
.
<year>1998</year>
;
<volume>95</volume>
:
<fpage>1460</fpage>
<lpage>1465</lpage>
.
<pub-id pub-id-type="pmid">9465037</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref044">
<label>44</label>
<mixed-citation publication-type="journal">
<name>
<surname>Matveeva</surname>
<given-names>OV</given-names>
</name>
,
<name>
<surname>Nazipova</surname>
<given-names>NN</given-names>
</name>
,
<name>
<surname>Ogurtsov</surname>
<given-names>AY</given-names>
</name>
,
<name>
<surname>Shabalina</surname>
<given-names>SA</given-names>
</name>
.
<article-title>Optimized models for design of efficient miR30-based shRNAs</article-title>
.
<source>Front Genet</source>
.
<year>2012</year>
;
<volume>3</volume>
:
<fpage>163</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fgene.2012.00163">10.3389/fgene.2012.00163</ext-link>
</comment>
<pub-id pub-id-type="pmid">22952469</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref045">
<label>45</label>
<mixed-citation publication-type="journal">
<name>
<surname>Matveeva</surname>
<given-names>OV</given-names>
</name>
,
<name>
<surname>Kang</surname>
<given-names>Y</given-names>
</name>
,
<name>
<surname>Spiridonov</surname>
<given-names>AN</given-names>
</name>
,
<name>
<surname>Saetrom</surname>
<given-names>P</given-names>
</name>
,
<name>
<surname>Nemtsov</surname>
<given-names>VA</given-names>
</name>
,
<name>
<surname>Ogurtsov</surname>
<given-names>AY</given-names>
</name>
,
<name>
<surname>Nechipurenko</surname>
<given-names>YD</given-names>
</name>
,
<name>
<surname>Shabalina</surname>
<given-names>SA</given-names>
</name>
.
<article-title>Optimization of duplex stability and terminal asymmetry for shRNA design</article-title>
.
<source>PLoS One</source>
.
<year>2010</year>
;
<volume>5</volume>
(
<issue>4</issue>
):
<fpage>e10180</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1371/journal.pone.0010180">10.1371/journal.pone.0010180</ext-link>
</comment>
<pub-id pub-id-type="pmid">20422034</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref046">
<label>46</label>
<mixed-citation publication-type="journal">
<name>
<surname>Hadiwikarta</surname>
<given-names>WW</given-names>
</name>
,
<name>
<surname>Carlon</surname>
<given-names>E</given-names>
</name>
,
<name>
<surname>Hooyberghs</surname>
<given-names>J</given-names>
</name>
.
<article-title>Dynamic range extension of hybridization sensors</article-title>
.
<source>Biosens Bioelectron</source>
.
<year>2015</year>
;
<volume>64</volume>
:
<fpage>411</fpage>
<lpage>415</lpage>
.
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.bios.2014.09.043">10.1016/j.bios.2014.09.043</ext-link>
</comment>
<pub-id pub-id-type="pmid">25280340</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0199162.ref047">
<label>47</label>
<mixed-citation publication-type="journal">
<name>
<surname>Cho</surname>
<given-names>H</given-names>
</name>
,
<name>
<surname>Chou</surname>
<given-names>H-H</given-names>
</name>
.
<article-title>Thermodynamically optimal whole-genome tiling microarray design and validation</article-title>
.
<source>BMC Res Notes</source>
.
<year>2016</year>
;
<volume>9</volume>
:
<fpage>305</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1186/s13104-016-2113-4">10.1186/s13104-016-2113-4</ext-link>
</comment>
<pub-id pub-id-type="pmid">27295952</pub-id>
</mixed-citation>
</ref>
</ref-list>
</back>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Pmc/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001045 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd -nk 001045 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Sante
   |area=    MersV1
   |flux=    Pmc
   |étape=   Corpus
   |type=    RBID
   |clé=     PMC:6013149
   |texte=   Sequence characteristics define trade-offs between on-target and genome-wide off-target hybridization of oligoprobes
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/RBID.i   -Sk "pubmed:29928000" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd   \
       | NlmPubMed2Wicri -a MersV1 

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Apr 20 23:26:43 2020. Site generation: Sat Mar 27 09:06:09 2021