Serveur sur les données et bibliothèques médicales au Maghreb (version finale)

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Explore, edit and leverage genomic annotations using Python GTF toolkit.

Identifieur interne : 000279 ( PubMed/Corpus ); précédent : 000278; suivant : 000280

Explore, edit and leverage genomic annotations using Python GTF toolkit.

Auteurs : F. Lopez ; G. Charbonnier ; Y. Kermezli ; M. Belhocine ; Q. Ferré ; N. Zweig ; M. Aribi ; A. Gonzalez ; S. Spicuglia ; D. Puthier

Source :

RBID : pubmed:30768152

English descriptors

Abstract

MOTIVATION

While Python has become very popular in bioinformatics, a limited number of libraries exist for fast manipulation of gene coordinates in Ensembl GTF format.

RESULTS

We have developed the GTF toolkit Python package (pygtftk), which aims at providing easy and powerful manipulation of gene coordinates in GTF format. For optimal performances, the core engine of pygtftk is a C dynamic library (libgtftk) while the Python API provides usability and readability for developing scripts. Based on this Python package, we have developed the gtftk command line interface that contains 57 sub-commands (v0.9.10) to ease handling of GTF files. These commands may be used to (i) perform basic tasks (e.g. selections, insertions, updates or deletions of features/keys), (ii) select genes/transcripts based on various criteria (e.g. size, exon number, transcription start site location, intron length, GO terms) or (iii) carry out more advanced operations such as coverage analyses of genomic features using bigWig files to create faceted read-coverage diagrams. In conclusion, the pygtftk package greatly simplifies the annotation of GTF files with external information while providing advance tools to perform gene analyses.

AVAILABILITY AND IMPLEMENTATION

pygtftk and gtftk have been tested on Linux and MacOSX and are available from https://github.com/dputhier/pygtftk under the MIT license. The libgtftk dynamic library written in C is available from https://github.com/dputhier/libgtftk.


DOI: 10.1093/bioinformatics/btz116
PubMed: 30768152

Links to Exploration step

pubmed:30768152

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Explore, edit and leverage genomic annotations using Python GTF toolkit.</title>
<author>
<name sortKey="Lopez, F" sort="Lopez, F" uniqKey="Lopez F" first="F" last="Lopez">F. Lopez</name>
<affiliation>
<nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Charbonnier, G" sort="Charbonnier, G" uniqKey="Charbonnier G" first="G" last="Charbonnier">G. Charbonnier</name>
<affiliation>
<nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Kermezli, Y" sort="Kermezli, Y" uniqKey="Kermezli Y" first="Y" last="Kermezli">Y. Kermezli</name>
<affiliation>
<nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
</affiliation>
<affiliation>
<nlm:affiliation>The Laboratory of Applied Molecular Biology and Immunology, Tlemcen University, Algeria.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Belhocine, M" sort="Belhocine, M" uniqKey="Belhocine M" first="M" last="Belhocine">M. Belhocine</name>
<affiliation>
<nlm:affiliation>Molecular Biology and Genetics Laboratory, Dubai, United Arab Emirates.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Ferre, Q" sort="Ferre, Q" uniqKey="Ferre Q" first="Q" last="Ferré">Q. Ferré</name>
<affiliation>
<nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Zweig, N" sort="Zweig, N" uniqKey="Zweig N" first="N" last="Zweig">N. Zweig</name>
<affiliation>
<nlm:affiliation>Aix Marseille Univ.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Aribi, M" sort="Aribi, M" uniqKey="Aribi M" first="M" last="Aribi">M. Aribi</name>
<affiliation>
<nlm:affiliation>The Laboratory of Applied Molecular Biology and Immunology, Tlemcen University, Algeria.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Gonzalez, A" sort="Gonzalez, A" uniqKey="Gonzalez A" first="A" last="Gonzalez">A. Gonzalez</name>
<affiliation>
<nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Spicuglia, S" sort="Spicuglia, S" uniqKey="Spicuglia S" first="S" last="Spicuglia">S. Spicuglia</name>
<affiliation>
<nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
</affiliation>
<affiliation>
<nlm:affiliation>Equipe Labellisée LIGUE contre le Cancer.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Puthier, D" sort="Puthier, D" uniqKey="Puthier D" first="D" last="Puthier">D. Puthier</name>
<affiliation>
<nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PubMed</idno>
<date when="2019">2019</date>
<idno type="RBID">pubmed:30768152</idno>
<idno type="pmid">30768152</idno>
<idno type="doi">10.1093/bioinformatics/btz116</idno>
<idno type="wicri:Area/PubMed/Corpus">000279</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">000279</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Explore, edit and leverage genomic annotations using Python GTF toolkit.</title>
<author>
<name sortKey="Lopez, F" sort="Lopez, F" uniqKey="Lopez F" first="F" last="Lopez">F. Lopez</name>
<affiliation>
<nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Charbonnier, G" sort="Charbonnier, G" uniqKey="Charbonnier G" first="G" last="Charbonnier">G. Charbonnier</name>
<affiliation>
<nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Kermezli, Y" sort="Kermezli, Y" uniqKey="Kermezli Y" first="Y" last="Kermezli">Y. Kermezli</name>
<affiliation>
<nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
</affiliation>
<affiliation>
<nlm:affiliation>The Laboratory of Applied Molecular Biology and Immunology, Tlemcen University, Algeria.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Belhocine, M" sort="Belhocine, M" uniqKey="Belhocine M" first="M" last="Belhocine">M. Belhocine</name>
<affiliation>
<nlm:affiliation>Molecular Biology and Genetics Laboratory, Dubai, United Arab Emirates.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Ferre, Q" sort="Ferre, Q" uniqKey="Ferre Q" first="Q" last="Ferré">Q. Ferré</name>
<affiliation>
<nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Zweig, N" sort="Zweig, N" uniqKey="Zweig N" first="N" last="Zweig">N. Zweig</name>
<affiliation>
<nlm:affiliation>Aix Marseille Univ.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Aribi, M" sort="Aribi, M" uniqKey="Aribi M" first="M" last="Aribi">M. Aribi</name>
<affiliation>
<nlm:affiliation>The Laboratory of Applied Molecular Biology and Immunology, Tlemcen University, Algeria.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Gonzalez, A" sort="Gonzalez, A" uniqKey="Gonzalez A" first="A" last="Gonzalez">A. Gonzalez</name>
<affiliation>
<nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Spicuglia, S" sort="Spicuglia, S" uniqKey="Spicuglia S" first="S" last="Spicuglia">S. Spicuglia</name>
<affiliation>
<nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
</affiliation>
<affiliation>
<nlm:affiliation>Equipe Labellisée LIGUE contre le Cancer.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Puthier, D" sort="Puthier, D" uniqKey="Puthier D" first="D" last="Puthier">D. Puthier</name>
<affiliation>
<nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
</affiliation>
</author>
</analytic>
<series>
<title level="j">Bioinformatics (Oxford, England)</title>
<idno type="eISSN">1367-4811</idno>
<imprint>
<date when="2019" type="published">2019</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Computational Biology (MeSH)</term>
<term>Genomics (MeSH)</term>
<term>Software (MeSH)</term>
</keywords>
<keywords scheme="MESH" xml:lang="en">
<term>Computational Biology</term>
<term>Genomics</term>
<term>Software</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p>
<b>MOTIVATION</b>
</p>
<p>While Python has become very popular in bioinformatics, a limited number of libraries exist for fast manipulation of gene coordinates in Ensembl GTF format.</p>
</div>
<div type="abstract" xml:lang="en">
<p>
<b>RESULTS</b>
</p>
<p>We have developed the GTF toolkit Python package (pygtftk), which aims at providing easy and powerful manipulation of gene coordinates in GTF format. For optimal performances, the core engine of pygtftk is a C dynamic library (libgtftk) while the Python API provides usability and readability for developing scripts. Based on this Python package, we have developed the gtftk command line interface that contains 57 sub-commands (v0.9.10) to ease handling of GTF files. These commands may be used to (i) perform basic tasks (e.g. selections, insertions, updates or deletions of features/keys), (ii) select genes/transcripts based on various criteria (e.g. size, exon number, transcription start site location, intron length, GO terms) or (iii) carry out more advanced operations such as coverage analyses of genomic features using bigWig files to create faceted read-coverage diagrams. In conclusion, the pygtftk package greatly simplifies the annotation of GTF files with external information while providing advance tools to perform gene analyses.</p>
</div>
<div type="abstract" xml:lang="en">
<p>
<b>AVAILABILITY AND IMPLEMENTATION</b>
</p>
<p>pygtftk and gtftk have been tested on Linux and MacOSX and are available from https://github.com/dputhier/pygtftk under the MIT license. The libgtftk dynamic library written in C is available from https://github.com/dputhier/libgtftk.</p>
</div>
</front>
</TEI>
<pubmed>
<MedlineCitation Status="MEDLINE" IndexingMethod="Curated" Owner="NLM">
<PMID Version="1">30768152</PMID>
<DateCompleted>
<Year>2020</Year>
<Month>06</Month>
<Day>10</Day>
</DateCompleted>
<DateRevised>
<Year>2020</Year>
<Month>06</Month>
<Day>10</Day>
</DateRevised>
<Article PubModel="Print">
<Journal>
<ISSN IssnType="Electronic">1367-4811</ISSN>
<JournalIssue CitedMedium="Internet">
<Volume>35</Volume>
<Issue>18</Issue>
<PubDate>
<Year>2019</Year>
<Month>09</Month>
<Day>15</Day>
</PubDate>
</JournalIssue>
<Title>Bioinformatics (Oxford, England)</Title>
<ISOAbbreviation>Bioinformatics</ISOAbbreviation>
</Journal>
<ArticleTitle>Explore, edit and leverage genomic annotations using Python GTF toolkit.</ArticleTitle>
<Pagination>
<MedlinePgn>3487-3488</MedlinePgn>
</Pagination>
<ELocationID EIdType="doi" ValidYN="Y">10.1093/bioinformatics/btz116</ELocationID>
<Abstract>
<AbstractText Label="MOTIVATION">While Python has become very popular in bioinformatics, a limited number of libraries exist for fast manipulation of gene coordinates in Ensembl GTF format.</AbstractText>
<AbstractText Label="RESULTS">We have developed the GTF toolkit Python package (pygtftk), which aims at providing easy and powerful manipulation of gene coordinates in GTF format. For optimal performances, the core engine of pygtftk is a C dynamic library (libgtftk) while the Python API provides usability and readability for developing scripts. Based on this Python package, we have developed the gtftk command line interface that contains 57 sub-commands (v0.9.10) to ease handling of GTF files. These commands may be used to (i) perform basic tasks (e.g. selections, insertions, updates or deletions of features/keys), (ii) select genes/transcripts based on various criteria (e.g. size, exon number, transcription start site location, intron length, GO terms) or (iii) carry out more advanced operations such as coverage analyses of genomic features using bigWig files to create faceted read-coverage diagrams. In conclusion, the pygtftk package greatly simplifies the annotation of GTF files with external information while providing advance tools to perform gene analyses.</AbstractText>
<AbstractText Label="AVAILABILITY AND IMPLEMENTATION">pygtftk and gtftk have been tested on Linux and MacOSX and are available from https://github.com/dputhier/pygtftk under the MIT license. The libgtftk dynamic library written in C is available from https://github.com/dputhier/libgtftk.</AbstractText>
<CopyrightInformation>© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.</CopyrightInformation>
</Abstract>
<AuthorList CompleteYN="Y">
<Author ValidYN="Y">
<LastName>Lopez</LastName>
<ForeName>F</ForeName>
<Initials>F</Initials>
<AffiliationInfo>
<Affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y">
<LastName>Charbonnier</LastName>
<ForeName>G</ForeName>
<Initials>G</Initials>
<AffiliationInfo>
<Affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y">
<LastName>Kermezli</LastName>
<ForeName>Y</ForeName>
<Initials>Y</Initials>
<AffiliationInfo>
<Affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</Affiliation>
</AffiliationInfo>
<AffiliationInfo>
<Affiliation>The Laboratory of Applied Molecular Biology and Immunology, Tlemcen University, Algeria.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y">
<LastName>Belhocine</LastName>
<ForeName>M</ForeName>
<Initials>M</Initials>
<AffiliationInfo>
<Affiliation>Molecular Biology and Genetics Laboratory, Dubai, United Arab Emirates.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y">
<LastName>Ferré</LastName>
<ForeName>Q</ForeName>
<Initials>Q</Initials>
<AffiliationInfo>
<Affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y">
<LastName>Zweig</LastName>
<ForeName>N</ForeName>
<Initials>N</Initials>
<AffiliationInfo>
<Affiliation>Aix Marseille Univ.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y">
<LastName>Aribi</LastName>
<ForeName>M</ForeName>
<Initials>M</Initials>
<AffiliationInfo>
<Affiliation>The Laboratory of Applied Molecular Biology and Immunology, Tlemcen University, Algeria.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y">
<LastName>Gonzalez</LastName>
<ForeName>A</ForeName>
<Initials>A</Initials>
<AffiliationInfo>
<Affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y">
<LastName>Spicuglia</LastName>
<ForeName>S</ForeName>
<Initials>S</Initials>
<AffiliationInfo>
<Affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</Affiliation>
</AffiliationInfo>
<AffiliationInfo>
<Affiliation>Equipe Labellisée LIGUE contre le Cancer.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y">
<LastName>Puthier</LastName>
<ForeName>D</ForeName>
<Initials>D</Initials>
<AffiliationInfo>
<Affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</Affiliation>
</AffiliationInfo>
</Author>
</AuthorList>
<Language>eng</Language>
<PublicationTypeList>
<PublicationType UI="D016428">Journal Article</PublicationType>
<PublicationType UI="D013485">Research Support, Non-U.S. Gov't</PublicationType>
</PublicationTypeList>
</Article>
<MedlineJournalInfo>
<Country>England</Country>
<MedlineTA>Bioinformatics</MedlineTA>
<NlmUniqueID>9808944</NlmUniqueID>
<ISSNLinking>1367-4803</ISSNLinking>
</MedlineJournalInfo>
<CitationSubset>IM</CitationSubset>
<MeshHeadingList>
<MeshHeading>
<DescriptorName UI="D019295" MajorTopicYN="N">Computational Biology</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D023281" MajorTopicYN="Y">Genomics</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D012984" MajorTopicYN="Y">Software</DescriptorName>
</MeshHeading>
</MeshHeadingList>
</MedlineCitation>
<PubmedData>
<History>
<PubMedPubDate PubStatus="received">
<Year>2018</Year>
<Month>11</Month>
<Day>12</Day>
</PubMedPubDate>
<PubMedPubDate PubStatus="revised">
<Year>2019</Year>
<Month>01</Month>
<Day>15</Day>
</PubMedPubDate>
<PubMedPubDate PubStatus="accepted">
<Year>2019</Year>
<Month>02</Month>
<Day>13</Day>
</PubMedPubDate>
<PubMedPubDate PubStatus="pubmed">
<Year>2019</Year>
<Month>2</Month>
<Day>16</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="medline">
<Year>2020</Year>
<Month>6</Month>
<Day>11</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="entrez">
<Year>2019</Year>
<Month>2</Month>
<Day>16</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
</History>
<PublicationStatus>ppublish</PublicationStatus>
<ArticleIdList>
<ArticleId IdType="pubmed">30768152</ArticleId>
<ArticleId IdType="pii">5320559</ArticleId>
<ArticleId IdType="doi">10.1093/bioinformatics/btz116</ArticleId>
</ArticleIdList>
</PubmedData>
</pubmed>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Sante/explor/MaghrebDataLibMedV2/Data/PubMed/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000279 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PubMed/Corpus/biblio.hfd -nk 000279 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Sante
   |area=    MaghrebDataLibMedV2
   |flux=    PubMed
   |étape=   Corpus
   |type=    RBID
   |clé=     pubmed:30768152
   |texte=   Explore, edit and leverage genomic annotations using Python GTF toolkit.
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/PubMed/Corpus/RBID.i   -Sk "pubmed:30768152" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/PubMed/Corpus/biblio.hfd   \
       | NlmPubMed2Wicri -a MaghrebDataLibMedV2 

Wicri

This area was generated with Dilib version V0.6.38.
Data generation: Wed Jun 30 18:27:05 2021. Site generation: Wed Jun 30 18:34:21 2021