Serveur d'exploration sur les dispositifs haptiques

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Multimodal perceptual organization of speech: Evidence from tone analogs of spoken utterances

Identifieur interne : 001004 ( Pmc/Curation ); précédent : 001003; suivant : 001005

Multimodal perceptual organization of speech: Evidence from tone analogs of spoken utterances

Auteurs : Robert E. Remez [États-Unis] ; Jennifer M. Fellowes [États-Unis] ; David B. Pisoni [États-Unis] ; Winston D. Goh [États-Unis] ; Philip E. Rubin [États-Unis]

Source :

RBID : PMC:3060793

Abstract

Theoretical and practical motives alike have prompted recent investigations of multimodal speech perception. Theoretically, multimodal studies have extended the conceptualization of perceptual organization beyond the familiar modality-bound accounts deriving from Gestalt psychology. Practically, such investigations have been driven by a need to understand the proficiency of multimodal speech perception using an electrocochlear prosthesis for hearing. In each domain, studies have shown that perceptual organization of speech can occur even when the perceiver’s auditory experience departs from natural speech qualities. Accordingly, our research examined auditor-visual multimodal integration of videotaped faces and selected acoustic constituents of speech signals, each realized as a single sinewave tone accompanying a video image of an articulating face. The single tone reproduced the frequency and amplitude of the phonatory cycle or of one of the lower three oral formants. Our results showed a distinct advantage for the condition pairing the video image of the face with a sinewave replicating the second formant, despite its unnatural timbre and its presentation in acoustic isolation from the rest of the speech signal. Perceptual coherence of multimodal speech in these circumstances is established when the two modalities concurrently specify the same underlying phonetic attributes.


Url:
DOI: 10.1016/S0167-6393(98)00050-8
PubMed: 21423823
PubMed Central: 3060793

Links toward previous steps (curation, corpus...)


Links to Exploration step

PMC:3060793

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Multimodal perceptual organization of speech: Evidence from tone analogs of spoken utterances</title>
<author>
<name sortKey="Remez, Robert E" sort="Remez, Robert E" uniqKey="Remez R" first="Robert E." last="Remez">Robert E. Remez</name>
<affiliation wicri:level="1">
<nlm:aff id="A1">Department of Psychology, Barnard College, 3009 Broadway, New York, NY 10027-6598, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Psychology, Barnard College, 3009 Broadway, New York, NY 10027-6598</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Fellowes, Jennifer M" sort="Fellowes, Jennifer M" uniqKey="Fellowes J" first="Jennifer M." last="Fellowes">Jennifer M. Fellowes</name>
<affiliation wicri:level="1">
<nlm:aff id="A2">Columbia University College of Physicians and Surgeons, 630 West 168th Street, New York, NY 10032, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Columbia University College of Physicians and Surgeons, 630 West 168th Street, New York, NY 10032</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Pisoni, David B" sort="Pisoni, David B" uniqKey="Pisoni D" first="David B." last="Pisoni">David B. Pisoni</name>
<affiliation wicri:level="1">
<nlm:aff id="A3">Speech Research Laboratory, Department of Psychology, Indiana University, Bloomington, IN 47405, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Speech Research Laboratory, Department of Psychology, Indiana University, Bloomington, IN 47405</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Goh, Winston D" sort="Goh, Winston D" uniqKey="Goh W" first="Winston D." last="Goh">Winston D. Goh</name>
<affiliation wicri:level="1">
<nlm:aff id="A3">Speech Research Laboratory, Department of Psychology, Indiana University, Bloomington, IN 47405, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Speech Research Laboratory, Department of Psychology, Indiana University, Bloomington, IN 47405</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Rubin, Philip E" sort="Rubin, Philip E" uniqKey="Rubin P" first="Philip E." last="Rubin">Philip E. Rubin</name>
<affiliation wicri:level="1">
<nlm:aff id="A4">Haskins Laboratories and Department of Surgery, Yale University School of Medicine, 270 Crown Street, New Haven, CT 06511, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Haskins Laboratories and Department of Surgery, Yale University School of Medicine, 270 Crown Street, New Haven, CT 06511</wicri:regionArea>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">21423823</idno>
<idno type="pmc">3060793</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3060793</idno>
<idno type="RBID">PMC:3060793</idno>
<idno type="doi">10.1016/S0167-6393(98)00050-8</idno>
<date when="1998">1998</date>
<idno type="wicri:Area/Pmc/Corpus">001004</idno>
<idno type="wicri:Area/Pmc/Curation">001004</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Multimodal perceptual organization of speech: Evidence from tone analogs of spoken utterances</title>
<author>
<name sortKey="Remez, Robert E" sort="Remez, Robert E" uniqKey="Remez R" first="Robert E." last="Remez">Robert E. Remez</name>
<affiliation wicri:level="1">
<nlm:aff id="A1">Department of Psychology, Barnard College, 3009 Broadway, New York, NY 10027-6598, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Psychology, Barnard College, 3009 Broadway, New York, NY 10027-6598</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Fellowes, Jennifer M" sort="Fellowes, Jennifer M" uniqKey="Fellowes J" first="Jennifer M." last="Fellowes">Jennifer M. Fellowes</name>
<affiliation wicri:level="1">
<nlm:aff id="A2">Columbia University College of Physicians and Surgeons, 630 West 168th Street, New York, NY 10032, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Columbia University College of Physicians and Surgeons, 630 West 168th Street, New York, NY 10032</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Pisoni, David B" sort="Pisoni, David B" uniqKey="Pisoni D" first="David B." last="Pisoni">David B. Pisoni</name>
<affiliation wicri:level="1">
<nlm:aff id="A3">Speech Research Laboratory, Department of Psychology, Indiana University, Bloomington, IN 47405, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Speech Research Laboratory, Department of Psychology, Indiana University, Bloomington, IN 47405</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Goh, Winston D" sort="Goh, Winston D" uniqKey="Goh W" first="Winston D." last="Goh">Winston D. Goh</name>
<affiliation wicri:level="1">
<nlm:aff id="A3">Speech Research Laboratory, Department of Psychology, Indiana University, Bloomington, IN 47405, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Speech Research Laboratory, Department of Psychology, Indiana University, Bloomington, IN 47405</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Rubin, Philip E" sort="Rubin, Philip E" uniqKey="Rubin P" first="Philip E." last="Rubin">Philip E. Rubin</name>
<affiliation wicri:level="1">
<nlm:aff id="A4">Haskins Laboratories and Department of Surgery, Yale University School of Medicine, 270 Crown Street, New Haven, CT 06511, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Haskins Laboratories and Department of Surgery, Yale University School of Medicine, 270 Crown Street, New Haven, CT 06511</wicri:regionArea>
</affiliation>
</author>
</analytic>
<series>
<title level="j">Speech communication</title>
<idno type="ISSN">0167-6393</idno>
<imprint>
<date when="1998">1998</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p id="P1">Theoretical and practical motives alike have prompted recent investigations of multimodal speech perception. Theoretically, multimodal studies have extended the conceptualization of perceptual organization beyond the familiar modality-bound accounts deriving from Gestalt psychology. Practically, such investigations have been driven by a need to understand the proficiency of multimodal speech perception using an electrocochlear prosthesis for hearing. In each domain, studies have shown that perceptual organization of speech can occur even when the perceiver’s auditory experience departs from natural speech qualities. Accordingly, our research examined auditor-visual multimodal integration of videotaped faces and selected acoustic constituents of speech signals, each realized as a single sinewave tone accompanying a video image of an articulating face. The single tone reproduced the frequency and amplitude of the phonatory cycle or of one of the lower three oral formants. Our results showed a distinct advantage for the condition pairing the video image of the face with a sinewave replicating the second formant, despite its unnatural timbre and its presentation in acoustic isolation from the rest of the speech signal. Perceptual coherence of multimodal speech in these circumstances is established when the two modalities concurrently specify the same underlying phonetic attributes.</p>
</div>
</front>
</TEI>
<pmc article-type="research-article" xml:lang="EN">
<pmc-comment>The publisher of this article does not allow downloading of the full text in XML form.</pmc-comment>
<pmc-dir>properties manuscript</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-journal-id">8309707</journal-id>
<journal-id journal-id-type="pubmed-jr-id">33841</journal-id>
<journal-id journal-id-type="nlm-ta">Speech Commun</journal-id>
<journal-title>Speech communication</journal-title>
<issn pub-type="ppub">0167-6393</issn>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">21423823</article-id>
<article-id pub-id-type="pmc">3060793</article-id>
<article-id pub-id-type="doi">10.1016/S0167-6393(98)00050-8</article-id>
<article-id pub-id-type="manuscript">NIHMS278485</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Multimodal perceptual organization of speech: Evidence from tone analogs of spoken utterances</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Remez</surname>
<given-names>Robert E.</given-names>
</name>
<xref ref-type="aff" rid="A1">a</xref>
<xref ref-type="corresp" rid="cor1">*</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Fellowes</surname>
<given-names>Jennifer M.</given-names>
</name>
<xref ref-type="aff" rid="A2">b</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Pisoni</surname>
<given-names>David B.</given-names>
</name>
<xref ref-type="aff" rid="A3">c</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Goh</surname>
<given-names>Winston D.</given-names>
</name>
<xref ref-type="aff" rid="A3">c</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Rubin</surname>
<given-names>Philip E.</given-names>
</name>
<xref ref-type="aff" rid="A4">d</xref>
</contrib>
</contrib-group>
<aff id="A1">
<label>a</label>
Department of Psychology, Barnard College, 3009 Broadway, New York, NY 10027-6598, USA</aff>
<aff id="A2">
<label>b</label>
Columbia University College of Physicians and Surgeons, 630 West 168th Street, New York, NY 10032, USA</aff>
<aff id="A3">
<label>c</label>
Speech Research Laboratory, Department of Psychology, Indiana University, Bloomington, IN 47405, USA</aff>
<aff id="A4">
<label>d</label>
Haskins Laboratories and Department of Surgery, Yale University School of Medicine, 270 Crown Street, New Haven, CT 06511, USA</aff>
<author-notes>
<corresp id="cor1">
<label>*</label>
Corresponding author. Tel.: +1 212 854 4247;
<email>remez@paradise.barnard.columbia.edu</email>
</corresp>
</author-notes>
<pub-date pub-type="nihms-submitted">
<day>8</day>
<month>3</month>
<year>2011</year>
</pub-date>
<pub-date pub-type="ppub">
<day>1</day>
<month>10</month>
<year>1998</year>
</pub-date>
<pub-date pub-type="pmc-release">
<day>18</day>
<month>3</month>
<year>2011</year>
</pub-date>
<volume>26</volume>
<issue>1</issue>
<fpage>65</fpage>
<lpage>73</lpage>
<permissions>
<copyright-statement>© 1998 Published by Elsevier Science B.V. All rights reserved.</copyright-statement>
<copyright-year>1998</copyright-year>
</permissions>
<abstract>
<p id="P1">Theoretical and practical motives alike have prompted recent investigations of multimodal speech perception. Theoretically, multimodal studies have extended the conceptualization of perceptual organization beyond the familiar modality-bound accounts deriving from Gestalt psychology. Practically, such investigations have been driven by a need to understand the proficiency of multimodal speech perception using an electrocochlear prosthesis for hearing. In each domain, studies have shown that perceptual organization of speech can occur even when the perceiver’s auditory experience departs from natural speech qualities. Accordingly, our research examined auditor-visual multimodal integration of videotaped faces and selected acoustic constituents of speech signals, each realized as a single sinewave tone accompanying a video image of an articulating face. The single tone reproduced the frequency and amplitude of the phonatory cycle or of one of the lower three oral formants. Our results showed a distinct advantage for the condition pairing the video image of the face with a sinewave replicating the second formant, despite its unnatural timbre and its presentation in acoustic isolation from the rest of the speech signal. Perceptual coherence of multimodal speech in these circumstances is established when the two modalities concurrently specify the same underlying phonetic attributes.</p>
</abstract>
<kwd-group>
<kwd>Multimodal speech perception</kwd>
<kwd>Perceptual organization</kwd>
<kwd>Auditory-visual speech perception</kwd>
<kwd>Intersensory integration</kwd>
<kwd>Speechreading</kwd>
<kwd>Sinewave speech</kwd>
</kwd-group>
<contract-num rid="DC1">R01 DC000111-25 ||DC</contract-num>
<contract-sponsor id="DC1">National Institute on Deafness and Other Communication Disorders : NIDCD</contract-sponsor>
</article-meta>
</front>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/HapticV1/Data/Pmc/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001004 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Curation/biblio.hfd -nk 001004 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    HapticV1
   |flux=    Pmc
   |étape=   Curation
   |type=    RBID
   |clé=     PMC:3060793
   |texte=   Multimodal perceptual organization of speech: Evidence from tone analogs of spoken utterances
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Curation/RBID.i   -Sk "pubmed:21423823" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Curation/biblio.hfd   \
       | NlmPubMed2Wicri -a HapticV1 

Wicri

This area was generated with Dilib version V0.6.23.
Data generation: Mon Jun 13 01:09:46 2016. Site generation: Wed Mar 6 09:54:07 2024