Incorporation of phonetic constraints in acoustic-to-articulatory inversion.
Identifieur interne : 000052 ( Ncbi/Merge ); précédent : 000051; suivant : 000053Incorporation of phonetic constraints in acoustic-to-articulatory inversion.
Auteurs : Blaise Potard [France] ; Yves Laprie ; Slim OuniSource :
- The Journal of the Acoustical Society of America [ 1520-8524 ] ; 2008.
Descripteurs français
- KwdFr :
- MESH :
English descriptors
- KwdEn :
- MESH :
Abstract
This study investigates the use of constraints upon articulatory parameters in the context of acoustic-to-articulatory inversion. These speaker independent constraints, referred to as phonetic constraints, were derived from standard phonetic knowledge for French vowels and express authorized domains for one or several articulatory parameters. They were experimented on in an existing inversion framework that utilizes Maeda's articulatory model and a hypercubic articulatory-acoustic table. Phonetic constraints give rise to a phonetic score rendering the phonetic consistency of vocal tract shapes recovered by inversion. Inversion has been applied to vowels articulated by a speaker whose corresponding x-ray images are also available. Constraints were evaluated by measuring the distance between vocal tract shapes recovered through inversion to real vocal tract shapes obtained from x-ray images, by investigating the spreading of inverse solutions in terms of place of articulation and constriction degree, and finally by studying the articulatory variability. Results show that these constraints capture interdependencies and synergies between speech articulators and favor vocal tract shapes close to those realized by the human speaker. In addition, this study also provides how acoustic-to-articulatory inversion can be used to explore acoustical and compensatory articulatory properties of an articulatory model.
DOI: 10.1121/1.2885747
PubMed: 18397035
Links toward previous steps (curation, corpus...)
- to stream PubMed, to step Corpus: 000142
- to stream PubMed, to step Curation: 000142
- to stream PubMed, to step Checkpoint: 000133
Links to Exploration step
pubmed:18397035Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Incorporation of phonetic constraints in acoustic-to-articulatory inversion.</title>
<author><name sortKey="Potard, Blaise" sort="Potard, Blaise" uniqKey="Potard B" first="Blaise" last="Potard">Blaise Potard</name>
<affiliation wicri:level="3"><nlm:affiliation>Speech Team, LORIA, UMR 7503, BP 239, 54506 Vandoeoeuvre-lès-Nancy Cedex, France. blaise.potard@loria.fr</nlm:affiliation>
<country xml:lang="fr">France</country>
<wicri:regionArea>Speech Team, LORIA, UMR 7503, BP 239, 54506 Vandoeoeuvre-lès-Nancy Cedex</wicri:regionArea>
<placeName><region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandoeoeuvre-lès-Nancy</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Laprie, Yves" sort="Laprie, Yves" uniqKey="Laprie Y" first="Yves" last="Laprie">Yves Laprie</name>
</author>
<author><name sortKey="Ouni, Slim" sort="Ouni, Slim" uniqKey="Ouni S" first="Slim" last="Ouni">Slim Ouni</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PubMed</idno>
<date when="2008">2008</date>
<idno type="doi">10.1121/1.2885747</idno>
<idno type="RBID">pubmed:18397035</idno>
<idno type="pmid">18397035</idno>
<idno type="wicri:Area/PubMed/Corpus">000142</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">000142</idno>
<idno type="wicri:Area/PubMed/Curation">000142</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Curation">000142</idno>
<idno type="wicri:Area/PubMed/Checkpoint">000133</idno>
<idno type="wicri:explorRef" wicri:stream="Checkpoint" wicri:step="PubMed">000133</idno>
<idno type="wicri:Area/Ncbi/Merge">000052</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">Incorporation of phonetic constraints in acoustic-to-articulatory inversion.</title>
<author><name sortKey="Potard, Blaise" sort="Potard, Blaise" uniqKey="Potard B" first="Blaise" last="Potard">Blaise Potard</name>
<affiliation wicri:level="3"><nlm:affiliation>Speech Team, LORIA, UMR 7503, BP 239, 54506 Vandoeoeuvre-lès-Nancy Cedex, France. blaise.potard@loria.fr</nlm:affiliation>
<country xml:lang="fr">France</country>
<wicri:regionArea>Speech Team, LORIA, UMR 7503, BP 239, 54506 Vandoeoeuvre-lès-Nancy Cedex</wicri:regionArea>
<placeName><region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandoeoeuvre-lès-Nancy</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Laprie, Yves" sort="Laprie, Yves" uniqKey="Laprie Y" first="Yves" last="Laprie">Yves Laprie</name>
</author>
<author><name sortKey="Ouni, Slim" sort="Ouni, Slim" uniqKey="Ouni S" first="Slim" last="Ouni">Slim Ouni</name>
</author>
</analytic>
<series><title level="j">The Journal of the Acoustical Society of America</title>
<idno type="eISSN">1520-8524</idno>
<imprint><date when="2008" type="published">2008</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Acoustics</term>
<term>Humans</term>
<term>Phonetics</term>
<term>Speech Perception</term>
<term>Speech Production Measurement</term>
</keywords>
<keywords scheme="KwdFr" xml:lang="fr"><term>Acoustique</term>
<term>Humains</term>
<term>Mesures de production de la parole</term>
<term>Perception de la parole</term>
<term>Phonétique</term>
</keywords>
<keywords scheme="MESH" xml:lang="en"><term>Acoustics</term>
<term>Humans</term>
<term>Phonetics</term>
<term>Speech Perception</term>
<term>Speech Production Measurement</term>
</keywords>
<keywords scheme="MESH" xml:lang="fr"><term>Acoustique</term>
<term>Humains</term>
<term>Mesures de production de la parole</term>
<term>Perception de la parole</term>
<term>Phonétique</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">This study investigates the use of constraints upon articulatory parameters in the context of acoustic-to-articulatory inversion. These speaker independent constraints, referred to as phonetic constraints, were derived from standard phonetic knowledge for French vowels and express authorized domains for one or several articulatory parameters. They were experimented on in an existing inversion framework that utilizes Maeda's articulatory model and a hypercubic articulatory-acoustic table. Phonetic constraints give rise to a phonetic score rendering the phonetic consistency of vocal tract shapes recovered by inversion. Inversion has been applied to vowels articulated by a speaker whose corresponding x-ray images are also available. Constraints were evaluated by measuring the distance between vocal tract shapes recovered through inversion to real vocal tract shapes obtained from x-ray images, by investigating the spreading of inverse solutions in terms of place of articulation and constriction degree, and finally by studying the articulatory variability. Results show that these constraints capture interdependencies and synergies between speech articulators and favor vocal tract shapes close to those realized by the human speaker. In addition, this study also provides how acoustic-to-articulatory inversion can be used to explore acoustical and compensatory articulatory properties of an articulatory model.</div>
</front>
</TEI>
<pubmed><MedlineCitation Owner="NLM" Status="MEDLINE"><PMID Version="1">18397035</PMID>
<DateCreated><Year>2008</Year>
<Month>04</Month>
<Day>09</Day>
</DateCreated>
<DateCompleted><Year>2008</Year>
<Month>05</Month>
<Day>27</Day>
</DateCompleted>
<Article PubModel="Print"><Journal><ISSN IssnType="Electronic">1520-8524</ISSN>
<JournalIssue CitedMedium="Internet"><Volume>123</Volume>
<Issue>4</Issue>
<PubDate><Year>2008</Year>
<Month>Apr</Month>
</PubDate>
</JournalIssue>
<Title>The Journal of the Acoustical Society of America</Title>
<ISOAbbreviation>J. Acoust. Soc. Am.</ISOAbbreviation>
</Journal>
<ArticleTitle>Incorporation of phonetic constraints in acoustic-to-articulatory inversion.</ArticleTitle>
<Pagination><MedlinePgn>2310-23</MedlinePgn>
</Pagination>
<ELocationID EIdType="doi" ValidYN="Y">10.1121/1.2885747</ELocationID>
<Abstract><AbstractText>This study investigates the use of constraints upon articulatory parameters in the context of acoustic-to-articulatory inversion. These speaker independent constraints, referred to as phonetic constraints, were derived from standard phonetic knowledge for French vowels and express authorized domains for one or several articulatory parameters. They were experimented on in an existing inversion framework that utilizes Maeda's articulatory model and a hypercubic articulatory-acoustic table. Phonetic constraints give rise to a phonetic score rendering the phonetic consistency of vocal tract shapes recovered by inversion. Inversion has been applied to vowels articulated by a speaker whose corresponding x-ray images are also available. Constraints were evaluated by measuring the distance between vocal tract shapes recovered through inversion to real vocal tract shapes obtained from x-ray images, by investigating the spreading of inverse solutions in terms of place of articulation and constriction degree, and finally by studying the articulatory variability. Results show that these constraints capture interdependencies and synergies between speech articulators and favor vocal tract shapes close to those realized by the human speaker. In addition, this study also provides how acoustic-to-articulatory inversion can be used to explore acoustical and compensatory articulatory properties of an articulatory model.</AbstractText>
</Abstract>
<AuthorList CompleteYN="Y"><Author ValidYN="Y"><LastName>Potard</LastName>
<ForeName>Blaise</ForeName>
<Initials>B</Initials>
<AffiliationInfo><Affiliation>Speech Team, LORIA, UMR 7503, BP 239, 54506 Vandoeoeuvre-lès-Nancy Cedex, France. blaise.potard@loria.fr</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y"><LastName>Laprie</LastName>
<ForeName>Yves</ForeName>
<Initials>Y</Initials>
</Author>
<Author ValidYN="Y"><LastName>Ouni</LastName>
<ForeName>Slim</ForeName>
<Initials>S</Initials>
</Author>
</AuthorList>
<Language>eng</Language>
<PublicationTypeList><PublicationType UI="D016428">Journal Article</PublicationType>
<PublicationType UI="D013485">Research Support, Non-U.S. Gov't</PublicationType>
</PublicationTypeList>
</Article>
<MedlineJournalInfo><Country>United States</Country>
<MedlineTA>J Acoust Soc Am</MedlineTA>
<NlmUniqueID>7503051</NlmUniqueID>
<ISSNLinking>0001-4966</ISSNLinking>
</MedlineJournalInfo>
<CitationSubset>IM</CitationSubset>
<MeshHeadingList><MeshHeading><DescriptorName MajorTopicYN="Y" UI="D000162">Acoustics</DescriptorName>
</MeshHeading>
<MeshHeading><DescriptorName MajorTopicYN="N" UI="D006801">Humans</DescriptorName>
</MeshHeading>
<MeshHeading><DescriptorName MajorTopicYN="Y" UI="D010700">Phonetics</DescriptorName>
</MeshHeading>
<MeshHeading><DescriptorName MajorTopicYN="Y" UI="D013067">Speech Perception</DescriptorName>
</MeshHeading>
<MeshHeading><DescriptorName MajorTopicYN="N" UI="D013068">Speech Production Measurement</DescriptorName>
</MeshHeading>
</MeshHeadingList>
</MedlineCitation>
<PubmedData><History><PubMedPubDate PubStatus="pubmed"><Year>2008</Year>
<Month>4</Month>
<Day>10</Day>
<Hour>9</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="medline"><Year>2008</Year>
<Month>5</Month>
<Day>28</Day>
<Hour>9</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="entrez"><Year>2008</Year>
<Month>4</Month>
<Day>10</Day>
<Hour>9</Hour>
<Minute>0</Minute>
</PubMedPubDate>
</History>
<PublicationStatus>ppublish</PublicationStatus>
<ArticleIdList><ArticleId IdType="doi">10.1121/1.2885747</ArticleId>
<ArticleId IdType="pubmed">18397035</ArticleId>
</ArticleIdList>
</PubmedData>
</pubmed>
<affiliations><list><country><li>France</li>
</country>
<region><li>Grand Est</li>
<li>Lorraine (région)</li>
</region>
<settlement><li>Vandoeoeuvre-lès-Nancy</li>
</settlement>
</list>
<tree><noCountry><name sortKey="Laprie, Yves" sort="Laprie, Yves" uniqKey="Laprie Y" first="Yves" last="Laprie">Yves Laprie</name>
<name sortKey="Ouni, Slim" sort="Ouni, Slim" uniqKey="Ouni S" first="Slim" last="Ouni">Slim Ouni</name>
</noCountry>
<country name="France"><region name="Grand Est"><name sortKey="Potard, Blaise" sort="Potard, Blaise" uniqKey="Potard B" first="Blaise" last="Potard">Blaise Potard</name>
</region>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Ncbi/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000052 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Ncbi/Merge/biblio.hfd -nk 000052 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Lorraine |area= InforLorV4 |flux= Ncbi |étape= Merge |type= RBID |clé= pubmed:18397035 |texte= Incorporation of phonetic constraints in acoustic-to-articulatory inversion. }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/Ncbi/Merge/RBID.i -Sk "pubmed:18397035" \ | HfdSelect -Kh $EXPLOR_AREA/Data/Ncbi/Merge/biblio.hfd \ | NlmPubMed2Wicri -a InforLorV4
![]() | This area was generated with Dilib version V0.6.33. | ![]() |