Serveur d'exploration sur les dispositifs haptiques

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

What information is necessary for speech categorization? Harnessing variability in the speech signal by integrating cues computed relative to expectations

Identifieur interne : 001949 ( Ncbi/Curation ); précédent : 001948; suivant : 001950

What information is necessary for speech categorization? Harnessing variability in the speech signal by integrating cues computed relative to expectations

Auteurs : Bob Mcmurray ; Allard Jongman

Source :

RBID : PMC:3523696

Abstract

Most theories of categorization emphasize how continuous perceptual information is mapped to categories. However, equally important is the informational assumptions of a model, the type of information subserving this mapping. This is crucial in speech perception where the signal is variable and context-dependent. This study assessed the informational assumptions of several models of speech categorization, in particular, the number of cues that are the basis of categorization and whether these cues represent the input veridically or have undergone compensation. We collected a corpus of 2880 fricative productions (Jongman, Wayland & Wong, 2000) spanning many talker- and vowel-contexts and measured 24 cues for each. A subset was also presented to listeners in an 8AFC phoneme categorization task. We then trained a common classification model based on logistic regression to categorize the fricative from the cue values, and manipulated the information in the training set to contrast 1) models based on a small number of invariant cues; 2) models using all cues without compensation, and 3) models in which cues underwent compensation for contextual factors. Compensation was modeled by Computing Cues Relative to Expectations (C-CuRE), a new approach to compensation that preserves fine-grained detail in the signal. Only the compensation model achieved a similar accuracy to listeners, and showed the same effects of context. Thus, even simple categorization metrics can overcome the variability in speech when sufficient information is available and compensation schemes like C-CuRE are employed.


Url:
DOI: 10.1037/a0022325
PubMed: 21417542
PubMed Central: 3523696

Links toward previous steps (curation, corpus...)


Links to Exploration step

PMC:3523696

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">What information is necessary for speech categorization? Harnessing variability in the speech signal by integrating cues computed relative to expectations</title>
<author>
<name sortKey="Mcmurray, Bob" sort="Mcmurray, Bob" uniqKey="Mcmurray B" first="Bob" last="Mcmurray">Bob Mcmurray</name>
</author>
<author>
<name sortKey="Jongman, Allard" sort="Jongman, Allard" uniqKey="Jongman A" first="Allard" last="Jongman">Allard Jongman</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">21417542</idno>
<idno type="pmc">3523696</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3523696</idno>
<idno type="RBID">PMC:3523696</idno>
<idno type="doi">10.1037/a0022325</idno>
<date when="2011">2011</date>
<idno type="wicri:Area/Pmc/Corpus">001673</idno>
<idno type="wicri:Area/Pmc/Curation">001673</idno>
<idno type="wicri:Area/Pmc/Checkpoint">001907</idno>
<idno type="wicri:Area/Ncbi/Merge">001949</idno>
<idno type="wicri:Area/Ncbi/Curation">001949</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">What information is necessary for speech categorization? Harnessing variability in the speech signal by integrating cues computed relative to expectations</title>
<author>
<name sortKey="Mcmurray, Bob" sort="Mcmurray, Bob" uniqKey="Mcmurray B" first="Bob" last="Mcmurray">Bob Mcmurray</name>
</author>
<author>
<name sortKey="Jongman, Allard" sort="Jongman, Allard" uniqKey="Jongman A" first="Allard" last="Jongman">Allard Jongman</name>
</author>
</analytic>
<series>
<title level="j">Psychological review</title>
<idno type="ISSN">0033-295X</idno>
<idno type="eISSN">1939-1471</idno>
<imprint>
<date when="2011">2011</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p id="P1">Most theories of categorization emphasize how continuous perceptual information is mapped to categories. However, equally important is the informational assumptions of a model, the type of information subserving this mapping. This is crucial in speech perception where the signal is variable and context-dependent. This study assessed the informational assumptions of several models of speech categorization, in particular, the number of cues that are the basis of categorization and whether these cues represent the input veridically or have undergone compensation. We collected a corpus of 2880 fricative productions (
<xref rid="R58" ref-type="bibr">Jongman, Wayland & Wong, 2000</xref>
) spanning many talker- and vowel-contexts and measured 24 cues for each. A subset was also presented to listeners in an 8AFC phoneme categorization task. We then trained a common classification model based on logistic regression to categorize the fricative from the cue values, and manipulated the information in the training set to contrast 1) models based on a small number of invariant cues; 2) models using all cues without compensation, and 3) models in which cues underwent compensation for contextual factors. Compensation was modeled by Computing Cues Relative to Expectations (C-CuRE), a new approach to compensation that preserves fine-grained detail in the signal. Only the compensation model achieved a similar accuracy to listeners, and showed the same effects of context. Thus, even simple categorization metrics can overcome the variability in speech when sufficient information is available and compensation schemes like C-CuRE are employed.</p>
</div>
</front>
</TEI>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/HapticV1/Data/Ncbi/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001949 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Ncbi/Curation/biblio.hfd -nk 001949 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    HapticV1
   |flux=    Ncbi
   |étape=   Curation
   |type=    RBID
   |clé=     PMC:3523696
   |texte=   What information is necessary for speech categorization? Harnessing variability in the speech signal by integrating cues computed relative to expectations
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Ncbi/Curation/RBID.i   -Sk "pubmed:21417542" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Ncbi/Curation/biblio.hfd   \
       | NlmPubMed2Wicri -a HapticV1 

Wicri

This area was generated with Dilib version V0.6.23.
Data generation: Mon Jun 13 01:09:46 2016. Site generation: Wed Mar 6 09:54:07 2024