Fundación Duques de Soria
Curso de Industrias de la Lengua
"Proyectos actuales en procesamiento
del lenguaje natural"
16 de julio de 1998
Corpus orales para la fonética y las tecnologías del habla
Joaquim Llisterri
CHURCH, K.W.- MERCER, R.L. (1993) "Introduction to the Special Issue on Computational Linguistics Using Large Corpora", Computational Linguistics 19,1: 1-24.
LLISTERRI, J. (1996)
Preliminary Recommendations on
Spoken Texts. EAGLES Document EAG-TCWG-STP/P,
May 1996. Publicación
electrónica en
http://www.ilc.cnr.it/EAGLES96/spokentx/spokentx.html
MOORE, R.K. (1991) "User Needs in Speech Research", Proceedings of the Workshop on European Textual Corpora. Pisa, Italy, 1991.
PAYNE, J. (1992) Speaking the Same Language? Listening to the Speech Community. Working Paper, Cobuild, Birmingham, NERC-132
POLS, L. C. W. (1987) "Speech Technology and Corpus Linguistics", in W. MEIJS (Ed.), Corpus Linguistics and Beyond. Proceedings of the Seventh International Conference on English Language Research on Computerized Corpora.Amsterdam: Rodopi.
SINCLAIR, J. (1996)
Preliminary Recommendations on
Corpus Typology. EAGLES Document
EAG-TCWG-CTYP/P, May 1996. Publicación
electrónica en
http://www.ilc.cnr.it/EAGLES96/corpustyp/corpustyp.html
CASTAGNERI, G. (Ed) (1991) Proceedings of the Workshop on International Cooperation and Standardization of Speech Databases and Speech i/O Assessment Methods. Chiavari 26-28 September 1991 (Italy). Organized by CSELT in cooperation with CEC DGXIII, ESCA, ESPRIT PROJECT 2589 (SAM)
GIBBON, D. - MOORE, R.- WINSKI,
R. (Eds.) (1998) Spoken
Language Systems and Corpus Design .Berlin:
Mouton De Gruyter. (Handbook
of Standards and Resources for Spoken Language
Systems, Volume I). Publicación
electrónica en
http://coral.lili.uni-bielefeld.de/EAGLES/
LAMEL, L.- COLE, R.A. (1997) "Spoken Language Corpora", in COLE, R.A.- MARIANI, J.- USZKOREIT, H.- ZAENEN, A.- ZUE, V. (Eds) Survey of the State of the Art in Human Language Technology. Cambridge: Cambridge University Press. pp. 450-454. Publicación electrónica en http://cslu.cse.ogi.edu/HLTsurvey/HLTsurvey.html
POLS, L.C.W. (Ed) (1990) Speech Input / Output Assessment and Speech Databases, Special Issue, Speech Communication 9,4.
ELRA Catalogue - Speech and
Related Resources
http://catalog.elra.info/
LDC, Linguistic Data
Consortium
http://www.ldc.upenn.edu/
LDC Catalogue - Speech
Corpora
http://www.ldc.upenn.edu/Catalog/byType.jsp#speech
COCOSDA, International
Coordination Committee on Speech
Databases and Speech I/O Assessment
http://www.cocosda.org/
NESCA, the ESCA Newsletter
http://archives.limsi.fr/WkG/Nesca//
EAGLES Spoken Language Working
Group
http://coral2.spectrum.uni-bielefeld.de:/~gibbon//gibbon_handbook_1997/index.html
GIBBON, D. - MOORE, R.- WINSKI,
R. (Eds.) (1998) Spoken
Language Systems and Corpus Design. Berlin:
Mouton De Gruyter. (Handbook
of Standards and Resources for Spoken Language
Systems, Volume I). Publicación
electrónica en
http://coral.lili.uni-bielefeld.de/EAGLES/
GIBBON, D. - MOORE, R.- WINSKI,
R. (Eds.) (1998) Spoken
Language Reference Materials. Berlin: Mouton
De Gruyter. (Handbook of
Standards and Resources for Spoken Language
Systems, Volume IV). Publicación
electrónica en
http://coral.lili.uni-bielefeld.de/EAGLES/
ESKÉNAZI, M. (1993) "Trends in Speaking Styles Research" inEurospeech'93. 3rd European Conference on Speech Communication and Technology. Berlin, Germany, 21-23 September 1993. Vol. 1 pp. 501-512
LIÉNARD, J.-S. (1995) "From speech variability to pattern processing: a non-reductive view of speech processing", in SORIN, C.- MARIANI, J.- MELONI, H.- SCHOENTGEN, J. (Eds.) Levels in Speech Communication. Relations and Interactions. A Tribute to Max Wajskop / Hommage à Max Wajskop. Amsterdam: Elsevier Science B.V. pp. 137-148
LLISTERRI, J. (1992) "Speaking Styles in Speech Research", ELSNET/SALT/ESCA Workshop Integrating Speech and Natural Language , University College Dublin, 15-17 July 1992. pp. 17-37.
POLS, L.C.W. (1986) "Variation and Interaction in Speech", in PERKELL, J.S.- KLATT, D.H. (Eds) Invariance and Variability in Speech Processes. Hillsdale: Lawrence Erlbaum. pp. 140-154
GIBBON, D. - MOORE, R.- WINSKI,
R. (Eds.) (1998) Spoken
Language Systems and Corpus Design. Berlin:
Mouton De Gruyter. (Handbook
of Standards and Resources for Spoken Language
Systems, Volume I). Publicación
electrónica en
http://coral.lili.uni-bielefeld.de/EAGLES/
HELFRICH, H. (1979) "Age markers in speech", in SCHERER, K.R. - GILES, H. (Eds) Social Markers in Speech. Cambridge- Paris: Cambridge University Press - Editions de la Maisons des Sciences de l'Homme. pp. 63-108
LAVER, J.- TRUDGILL, P. (1979) "Phonetic and linguistic markers in speech", in SCHERER, K.R. - GILES, H. (Eds) Social Markers in Speech. Cambridge- Paris: Cambridge University Press - Editions de la Maisons des Sciences de l'Homme. pp. 1-32; in LAVER, J. (1991) The Gift of Speech. Papers in the Analysis of Speech and Voice. Edinburgh: Edinburgh University Press. pp. 235-264
MILLAR, J.B.- HAWKINS, S.R. (1990) "Selecting representative speakers" Proceedings of the Tutorial and Research Workshop on Speaker Characterization in Speech Technology. Edinburgh, 26-28 June. Edinburgh: Center for Speech Technology Research.pp.161-166
ROBINSON, W.P. (1979) "Speech markers and social class", in SCHERER, K.R. - GILES, H. (Eds) Social Markers in Speech. Cambridge- Paris: Cambridge University Press - Editions de la Maisons des Sciences de l'Homme. pp. 211-250
SACHS, J.- LIEBERMAN, P.- ERIKSON, D. (1973) "Anatomical and cultural determinants of male and female speech" in R. SHUY - R. FASOLD (Eds) Language Attitudes: Current Trends and Prospects. Washington: Georgetown University Press.pp. 74-84.
SCHERER, K. R. (1979) "Personality markers in speech", in SCHERER, K.R. - GILES, H. (Eds) Social Markers in Speech. Cambridge- Paris: Cambridge University Press - Editions de la Maisons des Sciences de l'Homme. pp. 147-210
SMITH, P.M. (1979) "Sex markers in speech", in SCHERER, K.R. - GILES, H. (Eds) Social Markers in Speech. Cambridge- Paris: Cambridge University Press - Editions de la Maisons des Sciences de l'Homme. pp. 109-146
GIBBON, D. - MOORE, R.- WINSKI,
R. (Eds.) (1998) Spoken
Language Reference Materials. Berlin: Mouton
De Gruyter. (Handbook of
Standards and Resources for Spoken Language
Systems, Volume IV). Publicación
electrónica en
http://coral.lili.uni-bielefeld.de/EAGLES/
RIBEIRO, C.- TRANCOSO, I. - SERRALHEIRO, A. (1993) "A software tool for Speech Collection, Recognition and Reproduction" in Eurospeech'93. 3rd European Conference on Speech Communication and Technology. Berlin, Germany, 21-23 September 1993. Vol. 1 pp. 179-182
UCL (1992) "Speech acquisition and Annotation Protocols and Index of Mnemonics (SAM-UCL-018)" in SAM User Guide to ETR Tools. ESPRIT PROJECT 2589 (SAM) Multilingual Speech Input/Output Assessment, Methodology and Standardisation. Ref, SAM-UCL-G007.
ZANTEN, E. van- DAMEN, L.W.M. - HOUTEN, E. van "Collecting data for a speech database", in HEUVEN, V.J. van - POLS, L.C.W. (Eds) Analysis and synthesis of speech. Strategic research towards high quality text-to-speech generation. Berlin: Mouton de Gruyter (Speech Research Series)
ZEILIGER, J.- SERIGNAT, J.F. (1991) "Europec software V.4.1 User's Guide (SAM-ICP-045)" in SAM User Guide to ETR Tools .ESPRIT PROJECT 2589 (SAM) Multilingual Speech Input/Output Assessment, Methodology and Standardisation. Ref, SAM-UCL-G007, 1992
Map Task Corpus
http://www.hcrc.ed.ac.uk/maptask/
FRASER, N.- GILBERT, G.N. (1991) "Simulating speech systems", Computer Speech and Language 5,1: 81-99
PÉAN, V.- WILLIAMS, S.- ESKÉNAZI, M. (1993) "The Design and Recording of ICY, a Corpus for the Study of Intraspeaker Variability" in Eurospeech'93. 3rd European Conference on Speech Communication and Technology. Berlin, Germany, 21-23 September 1993. Vol. 1 pp. 627-630
SWERTS, M.- COLLIER, R. (1992) "On the controlled ellicitation of spontaneous speech", Speech Communication 11, 4-5: 463-468
LLISTERRI, J. (1997)
Transcripción, etiquetado
y codificación de corpus
orales. Fundación Duques de Soria,
Seminario de Industrias de la Lengua, 15 de julio
de 1997. Publicación
electrónica en
http://liceu.uab.cat/~joaquim/publicacions/FDS97.html
WELLS, J.C. (1989) "Computer-coded phonemic notation of individual languages of the European Community", Journal of the International Phonetic Association 19,1: 31-54
WELLS, J.C. (1994)
"Computer-coding the IPA: a proposed
extension of SAMPA", Speech, Hearing and
Language, Work in Progress,
1994 (University College London, Department
of Phonetics and Linguistics)
8: 271-289. Publicación electrónica
en
http://www.phon.ucl.ac.uk/home/sampa/x-sampa.htm
WELLS, J.C. (1995) SAMPA
Computer Readable Phonetic
Alphabet. Publicación
electrónica en
http://www.phon.ucl.ac.uk/home/sampa/home.htm
WELLS, J.C. (1995) SAMPROSA
(SAM Prosodic Transcription).
Publicación electrónica en
http://www.phon.ucl.ac.uk/home/sampa/samprosa.htm
BARRY, W.J.- FOURCIN, A.J. (1992) "Levels of Labelling", Computer Speech and Language 6: 1-14
MARCHAL, A.- NGUYEN, N.- HARDCASTLE, W. (1995) "Multitiered phonetic approach to speech labelling", in SORIN, C.- MARIANI, J.- MELONI, H.- SCHOENTGEN, J. (Eds.) Levels in Speech Communication. Relations and Interactions. A Tribute to Max Wajskop / Hommage à Max Wajskop. Amsterdam: Elsevier Science B.V. pp. 149-158
ROACH, P.- ROACH, H.- DEW, A.- ROWLANDS, P. (1990) "Phonetic analysis and the automatic segmentation and labeling of speech sounds", Journal of the International Phonetic Association 20,1: 15-21
TILLMANN, H.G.- POMPINO-MARSCHALL, B. (1993) "Theoretical Principles Concerning Segmentation, Labelling Strategies and Levels of Categorical Annotation for Spoken Language Database Systems" in Eurospeech'93. 3rd European Conference on Speech Communication and Technology. Berlin, Germany, 21-23 September 1993. Vol. 3 pp. 1691-1694
VORSTERMANS, A.- MARTENS, J.-P.-
VAN COILE, B. (1996)
"Automatic segmentation and labelling of
multilingual speech data", Speech
Communication 19,4: 271-294.
Guías de
transcripción y etiquetado
BECKMAN, M.E. - AYERS, G.M.
(1994) Guidelines for
ToBI Labelling. Version 2.0, February 1994.
Publicación electrónica
en
http://www.ling.ohio-state.edu/~tobi/
KEATING, P.- MacEACHERN, P.- SHRYOCK, A.- DOMINGUEZ, S. (1994) " A manual for phonetic transcription: Segmentation and labelling of words in spontaneous speech", UCLA Working Papers in Phonetics 88: 91-120.
KROT, C.- TAYLOR, B. (1995)
Criteria for Acoustic-Phonetic
Segmentation and Word Labelling in the Australian
National Database of
Spoken Language. Publicación
electrónica en
http://andosl.anu.edu.au/andosl/general_info/aue_criteria.html
LANDER, T. (1997) The CSLU
Labeling Guide. Center
for Spoken Language Understanding, Oregon
Graduate Institute. Publicación
electrónica en
http://ogi.edu/bme/cslu/corpora/docs/labeling.pdf
LLISTERRI, J.- GARRIDO
ALMIÑANA, J.M. (1998) "La
ingeniería lingüística en
España", in El
español en el mundo. Anuario del Instituto
Cervantes. 1988.
Madrid: Instituto Cervantes - Arco/Libros SL.,
1998. pp. 299-391. Publicación
electrónica en
http://cvc.cervantes.es/obref/anuario/parte2/cap3/indice.htm
SEPLN-OEIL (1997) Grupos de investigación en procesamiento del lenguaje y del habla en España, 1997. Sociedad Española para el Procesamiento del Lenguaje Natural - Observatorio Español de Industrias de la Lengua, Instituto Cervantes.
Departamento
de Señales, Sistemas y
Radiocomunicaciones, Escuela Técnica
Superior de Ingenieros de
Telecomunicación, Universidad
Politécnica
de Madrid
http://www.ssr.upm.es/
GPySC, Grupo
de Investigación en Procesamiento de
Señales y Comunicaciones,
Departamento de Electrónica y
Tecnología de Computadores,
Facultad de Ciencias, Universidad de
Granada
http://ceres.ugr.es
Grup de Fonètica,
Seminari de Filologia i Informàtica,
Departament de Filologia Espanyola,
Facultat de Filosofia i Lletres, Universitat
Autònoma de Barcelona
http://liceu.uab.cat
GTH, Grupo
de Tecnología del Habla, Departamento de
Ingeniería Electrónica,
Escuela Técnica Superior de Ingenieros de
Telecomunicación,
Universidad Politécnica de Madrid
http://www-gth.die.upm.es
RFIA, Grupo
de Reconocimiento de Formas e Inteligencia
Artificial, Departamento de
Sistemas Informáticos y
Computación, Facultad de
Informática,
Universidad Politécnica de Valencia
http://elirf.dsic.upv.es/elirf/
CASACUBERTA, F.- GARCIA, R.- LLISTERRI, J.- NADEU, C.- PARDO, J.M.- RUBIO, A. (1991) "Development of Spanish Corpora for Speech Research (Albayzin)", in CASTAGNERI, G. (Ed) Proceedings of the Workshop on International Cooperation and Standardization of Speech Databases and Speech i/O Assessment Methods. Chiavari 26-28 September 1991 (Italy).
CASACUBERTA, F.- GARCÍA, R.- LLISTERRI, J.- NADEU, C.- PARDO, J.M.- RUBIO, A. (1992) "Desarrollo de corpus para investigación en tecnologías del habla (Albayzín)", Procesamiento del Lenguaje Natural ,Boletín 12: 35-42
DÍAZ, J.- RUBIO, A.- PEINADO, A.- SEGARRA, E.- PRIETO, N.- CASACUBERTA, F. (1993) "Development of task-oriented Spanish speech corpora" in Eurospeech'93. 3rd European Conference on Speech Communication and Technology. Berlin, Germany, 21-23 September 1993.
DÍAZ, J.E.- PEINADO, A.M.- RUBIO, A.J.- SEGARRA, E.- PRIETO, N.- CASACUBERTA, F. (1998) "Albayzín: a task-oriented Spanish speech corpus", in Proceedings of the First International Conference on Language Resources and Evaluation. May 28 - 30, 1998, Granada, Spain.
LLISTERRI, J.- POCH, D. (1991) "Phonetic criteria for the development of a speech database in Spanish (the Albayzin project), in CASTAGNERI, G. (Ed) Proceedings of the Workshop on International Cooperation and Standardization of Speech Databases and Speech i/O Assessment Methods. Chiavari 26-28 September 1991 (Italy).
MORENO, A.- POCH, D.- BONAFONTE, A.- LLEIDA, E.- LLISTERRI, J.- MARIÑO, J.B.- NADEU, C. (1993) "ALBAYZIN Speech Database: Design of the Phonetic Corpus" in Eurospeech'93. 3rd European Conference on Speech Communication and Technology. Berlin, Germany, 21-23 September 1993. Vol. 1 pp. 175-178
NADEU, C. (1991) "Development of Spanish Databases for Speech Recognition", in CASTAGNERI, G. (Ed) Proceedings of the Workshop on International Cooperation and Standardization of Speech Databases and Speech i/O Assessment Methods. Chiavari 26-28 September 1991 (Italy).
Grup de Fonètica,
Seminari de Filologia i Informàtica,
Departament de Filologia Espanyola,
Facultat de Filosofia i Lletres, Universitat
Autònoma de Barcelona
http://liceu.uab.cat
http://www.phon.ucl.ac.uk/shop/eurom1.php
GIBBON, D. - MOORE, R.- WINSKI,
R. (Eds.) (1998) Spoken
Language Reference Materials. Berlin: Mouton
De Gruyter. (Handbook of
Standards and Resources for Spoken Language
Systems, Volume IV). Publicación
electrónica en
http://coral.lili.uni-bielefeld.de/EAGLES/
http://gps-tsc.upc.es/veu/LR/LR_EuromI.php3
MARIÑO, J.B. - LLISTERRI, J. (1993) Spanish adaptation of SAMPA and automatic phonetic transcription. SAM-A/UPC/001/v1 20th April 1993. ESPRIT PROJECT 6819 (SAM-A Speech Technology Assessment in Multilingual Applications).
MORENO, A. (1993) EUROM-1 Spanish Database. Report D6, SAM-A/UPC/003. September 1993
Grup de Fonètica,
Seminari de Filologia i Informàtica,
Departament de Filologia Espanyola,
Facultat de Filosofia i Lletres, Universitat
Autònoma de Barcelona
http://liceu.uab.cat
http://www.speechdat.org/SpeechDat.html
http://www.speechdat.org/SP-CAR/
WINSKI, R.- SENIA, F.- CONNER,
P.- HÄe&Szlig;-ÜMBACH,
R.- CONSTANTINESCU, A.- NIEDERMAIR, G.- MORENO,
A.- TRANCOSO, I. (1996)
Specification of Telephone Speech Data
Collection. LRE-63314 SPEECHDAT,
Deliverable D1.4.1. Publicación
electrónica en
http://www.speechdat.org/speechdt/speechdat_m/deliverables/D141.pdf
http://gps-tsc.upc.es/veu/sala2/
Grup de Processament de la Veu, Departament de Teoria del Senyal i Comunicacions, Escola Tècnica Superior d'Enginyers de Telecomunicació, Universitat Politènica de CatalunyaTelefónica I+D, Madrid
ENA Telecomunicaciones
http://www.sepln.org/revistaSEPLN/revista/10/10-7.pdf
Laboratori de Fonètica
Institut d'Estudis Catalans
http://www.iec.cat/coneixement/entrada_c.asp?c_epigraf_num=155
http://www.cstr.ed.ac.uk/research/projects/artic/accor.html
MARCHAL, A.- HARDCASTLE, W.J. (1993) "ACCOR: Instrumentation and database for the cross-language study of coarticulation", Language and Speech 36: 137-153
SCHMIDBAUER, O.- CASACUBERTA, F.- CASTRO, M.J.- HEGERL, G.- HÜGE, H.- SÁNCHEZ, J.A.- ZOLKARNIK, I. (1993) "Articulatory Representation and Speech Technology", Language and Speech 36, 2,3: 331-351.
http://aune.lpl.univ-aix.fr/projects/multext/index.html
LLISTERRI, J. (Ed.) (1996) Prosody Tools Efficiency and Failures. WP 4 Corpus. T4.6 Speech Markup and Validation. Deliverable 4.5.2. Final version. 15 October 1996. LRE Project 62-050 MULTEXT.
Telefónica I+D, Madrid
http://xml.coverpages.org/mate.html
TAPIAS, D.- ACERO, A.- ESTEVE, J.- TORRECILLA, C. (1993) "Dona tu voz a la ciencia", Factores Humanos de Telefónica I+D 3 (diciembre)
TAPIAS, A.- ACERO, A.- ESTEVE, J. - TORRECILLA, J.C. (1994) "The VESTEL Telephone Speech Database",in ICSLP'94. Proceedings of the International Conference on Spoken Language Processing 1994. pp. 1811-1814
Grupo de
Teoría de la Señal, Departamento de
Tecnologías de
las Comunicaciones, Universidade de Vigo
http://www.gts.tsc.uvigo.es/
VILLARRUBIA, L.- LEÓN, P.- HERNÁNDEZ, L.- NADEU, C,- ESQUERRA, I.- HERNANDO, J.- GARCÍA MATEO, C.- DOCIO, L. (1998) "VOCATEL and VOGATEL: Two Telephone Speech Databases of Spanish Minority Languages (Catalan and Galician)", Workshop on Language Resources for European Minority Languages , May 27 1998, Granada, Spain.
http://www-01.ibm.com/software/pervasive/embedded_viavoice/
LÓPEZ DE IPIÑA, K.- TORRES, I.- OÑEDERRA, L. (1998) "A speech database in Basque language", Workshop on Language Resources for European Minority Languages , May 27 1998, Granada, Spain.
http://elirf.dsic.upv.es/elirf/
Grup de Fonètica,
Seminari de Filologia i Informàtica,
Departament de Filologia Espanyola,
Facultat de Filosofia i Lletres, Universitat
Autònoma de Barcelona
http://liceu.uab.cat
Joaquim Llisterri, Universitat Autònoma
de Barcelona
Last modified: 6/11/11 19:21