Speechlab

Home Výzkum Publikace
Publikace PDF Tisk Email
 
All the documents from this web site are intelectual property of the authors. These documents may be freely used for personal purposes and can be used for public presentation only with permission of the authors. 
 
[2015] [2014] [2013] [2012] [2011] [2010] [2009] [2008] [2007] [2006] [2005] [2004] [2003] [2002] [2001] [2000] [1999] [1998] [1997] [1996] [1995]

2016

  • Malek J., Cerva P., Seps L., Nouza J.: Study on the use and adaptation of bottleneck features for robust speech recognition of nonlinearly distorted speech, In: 13th International Conference on Signal Processing and Multimedia Applications (SIGMAP 2016), Lisbon, Portugal, pp 65-71, DOI:10.5220/0005955500650071, WOS: 000391091400006, Scopus EID: 2-s2.0-85004090263, ISBN 978-989-758-196-0, 2016.SCOPUS ISI
  • Chaloupka J.: Automatic Symbol Processing for Language Model Building in Slavic Languages, In: Proc. of Information technologies Applications and Theory Conference - ITAT 2016, Slovak Republic, pp. 37-41, ISBN 978-1537016740, ISSN 1613-0073, 2016.
  • Nouza, J., Safarik, R., Cerva, P.: ASR for South Slavic Languages Developed in Almost Automated Way, In: Proc of the 17th Annual Conference of the International Speech Communication Association (INTERSPEECH 2016), San Francisco, USA, pp. 3868 - 3872, DOI: 10.21437/Interspeech.2016-747, Scopus EID: 2-s2.0-84994385032, ISSN 2308-457X, 2016.SCOPUS
  • Šafařík, R., Matějů, L.: Impact of Phonetic Annotation Precision on Automatic Speech Recognition Systems, In: Proc. of the 39th International Conference on Telecommunications and Signal Processing (TSP 2016), Vienna, Austria, pp. 311-314, DOI: 10.1109/TSP.2016.7760886, WOS :000390164000067, Scopus EID: 2-s2.0-85006826243, ISBN: 978-1-5090-1287-9, ISSN: 1805-5435, 2016.SCOPUS ISI
  • Boháč, M., Matějů, L., Rott, M., Šafařík, R., : Automatic Syllabification and Syllable Timing of Automatically Recognized Speech - for Czech, In. Proc. of the 19th International Conference of Text, Speech, and Dialogue - TSD 2016, Brno, Czech Republic, pp. 540-547, doi: 10.1007/978-3-319-45510-5_62, WOS:000389707400062, Scopus EID: 2-s2.0-85008368358, ISBN 978-3-319-45509-9, ISSN 0302-9743, 2016.SCOPUS ISI
  • Rott, M., Červa, P., : Speech-to-Text Summarization Using Automatic Phrase Extraction from Recognized Text, In. Proc. of the 19th International Conference of Text, Speech, and Dialogue - TSD 2016, Brno, Czech Republic, pp. 101-108, doi: 10.1007/978-3-319-45510-5_12, WOS: 000389707400012, Scopus EID: 2-s2.0-85008414224, ISBN 978-3-319-45509-9, ISSN 0302-9743, 2016.SCOPUS ISI
  • Kovář, V., Machura, J., Zemková, K., Rott, M.: Evaluation and Improvements in Punctuation Detection for Czech, In. Proc. of the 19th International Conference of Text, Speech, and Dialogue - TSD 2016, Brno, Czech Republic, pp. 287-294, ISBN 978-3-319-45509-9, ISSN 0302-9743, 2016.SCOPUS ISI
  • Palecek, K., Chaloupka, J.: Depth-based Features in Audio-Visual Speech Recognition, In: Proc. of the 39th International Conference on Telecommunications and Signal Processing (TSP 2016), Vienna, Austria, pp. 303-306, DOI: 10.1109/TSP.2016.7760884, WOS: 000390164000065, Scopus EID: 2-s2.0-85006804636, ISBN: 978-1-5090-1287-9, ISSN: 1805-5435, 2016.SCOPUS ISI
  • Palecek, K.: Lipreading Using Spatiotemporal Histogram of Oriented Gradients, In: 24th European Signal Processing Conference (EUSIPCO 2016), Budapest, Hungary, pp. 1882-1885, DOI: 10.1109/EUSIPCO.2016.7760575, WOS: 000391891900359, Scopus EID: 2-s2.0-85005976126, ISBN 978-0-9928-6265-7, 2016.SCOPUS ISI
  • Mateju, L., Cerva, P., Zdansky, J.: Study on the Use of Deep Neural Networks for Speech Activity Detection in Broadcast Recordings, In: 13th International Conference on Signal Processing and Multimedia Applications (SIGMAP 2016), Lisbon, Portugal, pp 45-51, DOI: 10.5220/0005952700450051, WOS: 000391091400004, Scopus EID: 2-s2.0-85004178567, ISBN 978-989-758-196-0, 2016.SCOPUS ISI

2015

  • Safarik, R., Nouza, J.: Methods for Rapid Development of Automatic Speech Recognition System for Russian, In: 2015 IEEE International Workshop of Electronics, Control, Measurement, Signals and their application to Mechatronics, Czech Republic, pp. 26-31,ISBN: 978-1-4799-6972-2, WOS: 000363814500011, 2015 SCOPUS ISI
  • Mateju, L., Cerva, P., Zdansky, J.: Investigation into the use of deep neural networks for LVCSR of Czech, In: 2015 IEEE International Workshop of Electronics, Control, Measurement, Signals and their application to Mechatronics, Czech Republic, pp. 38-41,ISBN: 978-1-4799-6972-2, WOS:000363814500033, 2015 SCOPUS ISI
  • Bohac, M., Rott, M., Blavka, K.: On Automatic Cross-Lingual Subtitle Timing, In: 2015 IEEE International Workshop of Electronics, Control, Measurement, Signals and their application to Mechatronics, Czech Republic, pp. 32-37,ISBN: 978-1-4799-6972-2, WOS:000363814500029, 2015 SCOPUS ISI
  • Nouza, J., Cerva, P., Safarik, R.: Cross-Lingual Adaptation of Broadcast Transcription System to Polish Language Using Public Data Sources, In: 7th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Poland, pp. 181-185, ISBN 978-83-932640-8-7, 2015
  • Bohac, M., Rott, M.: Exploiting of the timing information in subtitle-like parallel multilingual data, In: 7th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Poland, pp. 208-212, ISBN 978-83-932640-8-7, 2015
  • Rott, M., Cerva, P.: Study on Methods for Vector Representation of Text for Topic-based Clustering of News Articles, In: 7th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Poland, pp. 530-534, ISBN 978-83-932640-8-7, 2015
  • Rott, M.: The Initial Study of Term Vector Generation Methods for News Summarization, in Proc. of Recent Advances in Slavonic Natural Language Processing RASLAN 2015, pp. 23-30, ISSN 2336-4289, ISBN 978-80-263-0974-1, 2015
  • Chuong N., T., Chaloupka, J., Nouza, J.: Study on Incorporating Tone into Speech Recognition of Vietnamese, In: 2015 IEEE International Workshop of Electronics, Control, Measurement, Signals and their application to Mechatronics, Czech Republic, pp. 42-47,ISBN: 978-1-4799-6972-2, WOS:000363814500013, 2015 SCOPUS ISI

2014

  • Chaloupka, J., Nouza, J., Malek, J., Silovsky, J.: Phone Speech Detection and Recognition in the Task of Historical Radio Broadcast Transcription, In: Proc. of Telecommunications and Signal Processing (TSP) conference, Berlin, Germany, pp. 433 – 436, ISBN: 978-80-214-4983-1, ISSN 1805-5435, 2014 SCOPUS
  • Nouza, J., Blavka, K., Boháč, M., Červa, P., Málek, J.: System for Producing Subtitles to Internet Audio-Visual Documents, In: Proc. of Telecommunications and Signal Processing (TSP) conference, Berlin, Germany, pp. 437 – 441, ISBN: 978-80-214-4983-1, 2014 SCOPUS
  • Nouza, J., Cerva, P., Zdansky, J., Blavka, K., Bohac, M., Silovsky, J., Chaloupka, J., Kucharova, M., Seps, L., Malek, J., Rott, M.: Speech-To-Text Technology to Transcribe and Disclose 100,000+ Hours of Bilingual Documents from Historical Czech and Czechoslovak Radio Archive, In: Proceedings of the 15th Annual Conference of the International Speech Communication Association (INTERSPEECH 2014), Singapore, pp. 964-968, ISSN 2308-457X, 2014 SCOPUS
  • Seps, L., Malek, J., Cerva, P., Nouza, J.: Investigation of Deep Neural Networks for Robust Recognition of Nonlinearly Distorted Speech, In: Proceedings of the 15th Annual Conference of the International Speech Communication Association (INTERSPEECH 2014), Singapore, pp. 363-367, ISSN 2308-457X, 2014 SCOPUS
  • Malek, J., Silovsky, J., Cerva, P., Koldovsky, Z., Nouza, J., Zdansky, J.: Compensation of Nonlinear Distortions in Speech for Automatic Recognition, In: Proc. of Telecommunications and Signal Processing (TSP) conference, Berlin, Germany, pp. 419-423, ISBN: 978-80-214-4983-1, 2014 SCOPUS
  • Kucharova, M., Skodova, S., Seps, L., Bohac, M.: Study on Phrases Used for Semi-automatic Text-based Speakers’ Names Extraction in the Czech Radio Broadcasts News, In 17th International Conference, TSD 2014, Springer-Verlag Berlin Heidelberg, pp. 416-423, ISSN 0302-9743, ISBN 978-331910815-5, DOI: 10.1007/978-3-319-10816-2_50, 2014 SCOPUS
  • Rott, M., Cerva, P.: Investigation of Latent Semantic Analysis for Clustering of Czech News Articles, In: 25th International Workshop on Database and Expert Systems Applications, Munich, Germany, pp. 223-227, ISSN 1529-4188, ISBN 978-1-4799-5722-4, 2014 SCOPUS
  • Palecek, K.: Comparison of Depth-based Features for Lipreading, In: Proc. of Telecommunications and Signal Processing (TSP) conference, Berlin, Germany, pp. 658 – 651, ISBN: 978-80-214-4983-1, 2014 SCOPUS
  • Bohac, M., Blavka, K.: Using Suprasegmental Information in Recognized Speech Punctuation Completion, In 17th International Conference, TSD 2014, Springer-Verlag Berlin Heidelberg, pp. 555-562, ISSN 0302-9743, ISBN 978-331910815-5, DOI: 10.1007/978-3-319-10816-2_50, 2014 SCOPUS
  • Silovsky, J., Nouza, J., Kucharova, M.: Search for speaker identity in historical oral archives, In: An International Journal Multimedia Tools and Applications, pp. 1-20, ISSN 1380-7501, July 2014, DOI 10.1007/s11042-014-2067-2 SCOPUS
  • Paleček, K.: Extraction of Features for Lip-reading Using Autoencoders. In: Proceedings of the 16th International Conference on Speech and Computer (SPECOM), 2014, Novi Sad, Serbia, pp. 209-216, ISBN 978-3-319-11580-1, ISSN 0302-9743, 2014 SCOPUS ISI
  • Koldovský, Z., Tichavský, P.: A Homotopy Recursive-in-Model-Order Algorithm for Weighted Lasso, Proc. of the 41st IEEE International Conference on Audio, Speech, and Signal Processing (ICASSP 2014), Florence, Italy, pp. 4179-4183, May 2014
  • Málek, J., Koldovský, Z.: Sparse Target Cancellation Filters with Application to Semi-Blind Noise Extraction, Proc. of the 41st IEEE International Conference on Audio, Speech, and Signal Processing (ICASSP 2014), Florence, Italy, pp. 2128-2132, May 2014 SCOPUS ISI
  • Koldovský, Z., Málek, J., Müller, M., Tichavský, P.: On Semi-Blind Estimation of Echo Paths During Double-Talk Based on Nonstationarity, Proc. of the 14th International Workshop on Acoustic Signal Enhancement (IWAENC 2014), pp. 199-203, Antibes – Juan les Pins, France, Sept. 2014 SCOPUS ISI
  • Málek, J., Botka, D., Koldovský, Z., Gannot, S.: Methods to Learn Bank of Filters Steering Nulls toward Potential Positions of a Target Source, Proc. of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA 2014), Nancy, France, May 12-14, 2014 SCOPUS ISI
  • Koldovský, Z., Tichavský, P.: Způsob potlačení ššumu a zvýraznění řečového signálu pro mobilní telefon se dvěma nebo více mikrofony, Czech patent nr. 304330, 2014

2013

  • Nouza, J., Cerva, P., Silovsky, J.: Adding Controlled Amount of Noise to Improve Recognition of Compressed and Spectrally Distorted Speech, In International Conference on Acoustics, Speech, and Signal Processing Mobile App - ICASSP 2013, Vencouver, Canada, pp. 8046-8050, ISBN 978-1-4799-0356-6,2013 SCOPUS ISI
  • Nouza, J., Cerva, P., Kucharova, M.: Cost-Efficient Development of Acoustic Models for Speech Recognition of Related Languages, In Radioengineering, vol. 22, no. 3, September 2013, pp. 866-873, ISSN 1210-2512, 2013 SCOPUS  ISI
  • Nouza, J., Cerva, P., Silovsky, J.: Automatic Transcription of Bilingual Historical Broadcast Archive of former Czechoslovak Radio, In ICIAP 2013 - International Workshop on Multimedia for Cultural Heritage MM4CH, Springer-Verlag Berlin Heidelber, Italy, pp. 238-246, ISBN 978-3-642-41189-2, 2013
  • Chaloupka, J., Nouza, J., Kucharova, M.: Using Different Types of Multimedia Resources to Train System for Automatic Transcription of Czech Historical Oral Archives, In ICIAP 2013 - International Workshop on Multimedia for Cultural Heritage MM4CH, Springer-Verlag Berlin Heidelber, Italy, pp. 228-237, ISBN 978-3-642-41189-2, 2013
  • Chaloupka, J., Nouza, J., Cerva, P., Malek, J.: Downdating lexicon and language model for automatic transcription of Czech historical spoken documents, In 16th International Conference, TSD 2013, Springer-Verlag Berlin Heidelberg, pp. 201-208, ISSN 0302-9743, 2013 SCOPUS ISI
  • Malek, J.: Blind Compensation of Memoryless Nonlinear Distortions in Sparse Signals, In Proc. EUSIPCO 2013, Marrakech, Morocco, 2013 SCOPUS ISI
  • Seps, L.: NanoTrans – Editor for Orthographic and Phonetic Transcriptions, In 36th International Conference on Telecommunications and Signal Processing (TSP), Italy, pp. 479-483, ISBN 978-1-4799-0403-7, 2013 SCOPUS  ISI
  • Kucharova, M., Nouza, J., Cerva, P.: Impact of Microphone on Computer Applications with Voice Input Modality, In 36th International Conference on Telecommunications and Signal Processing (TSP), Italy, pp. 469-473, ISBN 978-1-4799-0403-7, 2013  SCOPUS  ISI
  • Kucharova, M., Skodova, S., Seps, L., Labus, V., Nouza, J., Bohac, M.: On the Quantitative and Qualitative Speech Changes of the Czech Radio Broadcast News within Years 1969-2005, In 16th International Conference, TSD 2013, Springer-Verlag Berlin Heidelberg, pp. 360-368, ISSN 0302-9743, 2013 SCOPUS ISI
  • Bohac, M., Seps, L.: Comparison of Several Techniques for Detection of Key Slides in Lecture Support Materials, In 36th International Conference on Telecommunications and Signal Processing (TSP), Italy, pp. 783-787, ISBN 978-1-4799-0403-7, 2013  SCOPUS ISI
  • Bohac, M., Blavka, K.: Text-to-Speech Alignment for Imperfect Transcriptions, In 16th International Conference, TSD 2013, Springer-Verlag Berlin Heidelberg, pp. 536-543, ISSN 0302-9743, 2013 SCOPUS  ISI
  • Rott, M., Cerva, P.: SummEC: A Summarization Engine for Czech, In 16th International Conference, TSD 2013, Springer-Verlag Berlin Heidelberg, pp. 527-535, ISSN 0302-9743, 2013 SCOPUS  ISI
  • Cerva, P., Silovsky, J., Zdansky, J., Nouza, J., Seps, L.: Speaker-adaptive speech recognition using speaker diarization for improved transcription of large spoken archives. In: Speech Communication, vol. 55, no. 10, pp. 1033-1046, ISSN 0167-6393, 2013  SCOPUS ISI
  • Palecek, K., Chaloupka, J.: Audio-Visual Speech Recognition in Noisy Audio Environments, In 36th International Conference on Telecommunications and Signal Processing (TSP), Italy, pp. 484-487, ISBN 978-1-4799-0403-7, 2013 SCOPUS  ISI
  • Chuong, N. T., Chaloupka, J.: Visual Feature Extraction for Isolated Word Visual Only Speech Recognition of Vietnamese, In 36th International Conference on Telecommunications and Signal Processing (TSP), Italy, pp. 459-463, ISBN 978-1-4799-0403-7, 2013  SCOPUS ISI
  • Chuong, N. T., Chaloupka, J.: Developing Text and Speech Databases for Speech Recognition of Vietnamese, In IEEE 7th International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS), berlin, Germany, pp. 163-166, ISBN 978-1-4799-1426-5, 2013 SCOPUS
  • Chuong, N. T., Chaloupka, J.: Phoneme Set and Pronouncing Dictionary Creation for Large Vocabulary Continuous Speech Recognition of Vietnamese, In 16th International Conference, TSD 2013, Springer-Verlag Berlin Heidelberg, pp. 394-401, ISSN 0302-9743, 2013 SCOPUS ISI
  • Z. Koldovský, P. Tichavský, A. H. Phan, and A. Cichocki, "A Two-Stage MMSE Beamformer for Underdetermined Signal Separation," IEEE Signal Processing Letters, Vol. 20, No. 12, pp. 1227-1230, Dec. 2013 SCOPUS  ISI
  • Z. Koldovský, J. Málek, P. Tichavský, and F. Nesta, "Semi-blind Noise Extraction Using Partially Known Position of the Target Source", IEEE Trans. on Speech, Audio and Language Processing, vol. 21, no. 10, pp. 2029-2041, Oct. 2013.  SCOPUS ISI
  • J. Málek, Z. Koldovský, S. Gannot, and P. Tichavský, "Informed Generalized Sidelobe Canceler Utilizing Sparsity of Speech Signals," Proc. of IEEE International Workshop on Machine Learning for Signal Processing, Southampton, UK, Sept. 2013. SCOPUS
  • Z. Koldovský, P. Tichavský, D. Botka, "Noise Reduction in Dual-Microphone Mobile Phones Using A Bank of Pre-Measured Target-Cancellation Filters," Proc. of ICASSP 2013, pp. 679-683, Vancouver, Canada, May 2013. SCOPUS  ISI

2012

  • Nouza, J., Blavka, K., Bohac, M., Cerva, P., Zdansky, J., Silovsky, J. and Prazak, J.: Voice Technology to Enable Sophisticated Access to Historical Audio Archive of the Czech Radio. In: Proc. of Multimedia for Cultural Heritage, vol. 247, Springer, Berlin Heidelberg, ISBN 978-3-642-27977-5, ISSN 1865-0929, pp. 27-38, 2012 SCOPUS ISI
  • Nouza, J., Blavka, K., Cerva, P., Zdansky, J., Silovsky, J., Bohac, M. and Prazak, J.: Making Czech Historical Radio Archive Accessible and Searchable for Wide Public. In: Journal of Multimedia, vol. 7, no. 2, Academy Publisher, pp. 159 – 169, ISSN 1796-2048, 2012 SCOPUS
  • Nouza, J., Blavka, K., Žďánský, J., Červa, P, Silovský, J, Boháč, M., Chaloupka, J., Kuchařová, M., Šeps, L.: Large-Scale Processing, Indexing and Search System for Czech Audio-Visual Cultural Heritage Archives. In: Proc. of IEEE conf. on Multimedia Signal Processing (MMSP), Banff, Canada, pp. 337-342, ISBN 978-146734572-9, 2012 ISI
  • Nouza, J., Cerva, P., Zdansky, J., Kucharova, M.: A Study on Adapting Czech Automatic Speech Recognition System to Croatian Language, In: proc. of 54th International Symposium ELMAR-2012, Croatia, pp. 227-230, ISBN 978-953704413-8, 2012 SCOPUS
  • Chaloupka, J., Červa, P., Silovský, J., Žd'ánský, J., Nouza, J. : Modification of the Speech Feature Extraction Module for the Improvement of the System for Automatic, In: proc. of 54th International Symposium ELMAR-2012, Croatia, pp. 223-226, ISBN 978-953704413-8, 2012 SCOPUS
  • Boháč, M., Blavka, K., Kuchařová, M., Škodová, S. : Post-processing of the Recognized Speech for Web Presentation of Large Audio Archive, In: Proc. of Telecommunications and Signal Processing (TSP) conference, Prague, pp. 441 – 445, ISBN: 978-1-4673-1117-5, 2012 SCOPUS ISI
  • Boháč, M., Nouza, J., Blavka K.: Investigation on Most Frequent Errors in Large-Scale Speech Recognition Applications. In: Proc. of Text, Speech and Dialogue (TSD). Springer Verlag Berlin Heidelberg, Series LNCS 7499, pp. 520-527, ISBN 978-3-642-32789-6, ISSN 0302-9743, 2012 SCOPUS
  • Bohac, M.: Performance Comparison of Several Techniques to Detect Keywords in Audio Streams and Audio Scene, In: proc. of 54th International Symposium ELMAR-2012, Croatia, pp. 215-218, ISBN 978-953704413-8, 2012 SCOPUS
  • Prazak, J., Bohac, M.: Speaker Diarization of Broadcast Audio Using Automatic Transcription, iVectors and Cosine Distance Scoring, In: proc. of 54th International Symposium ELMAR-2012, Croatia, pp. 211-214, ISBN 978-953704413-8, 2012 SCOPUS
  • Silovsky, J., Zdansky, J., Nouza, J., Cerva, P., Prazak, J.: Incorporation of the ASR output in speaker segmentation and clustering within the task of speaker diarization of broadcast streams, In: Proc. of IEEE conf. on Multimedia Signal Processing (MMSP), Banff, Canada, pp. 118-123, ISBN 978-146734572-9, 2012 SCOPUS ISI
  • Silovský, J., Červa, P., Žďánský, J., Nouza J.: Study on Integration of Speaker Diarization with Speaker Adaptive Speech Recognition for Broadcast Transcription. In: Proc. of Interspeech 2012, Portland, USA, 2012
  • Silovsky, J., Prazak, J.: Speaker Diarization of Broadcast Streams using Two-stage Clustering based on I-vectors and Cosine Distance Scoring, In. proc. of International Conference on Acoustics, Speech, and Signal Processing - ICASSP 2012, Kyoto, Japan, pp. 4193-4196, ISBN: 978-146730046-9, 2012 SCOPUS ISI
  • Škodová, S., Kuchařová, M., Šeps, L.: Discretion of Speech Units for the Text Post-processing Phase of Automatic Transcription (in the Czech Language), In: Proc. of Text, Speech and Dialogue (TSD). Springer Verlag Berlin Heidelberg, Series LNCS 7499, pp. 446-455, ISBN 978-3-642-32789-6, ISSN 0302-9743, 2012 SCOPUS
  • Palecek, K.: Detection of Similar Advertisements in Media Databases, In: Lecture Notes in Computer Science, Springer-Verlag Berlin, vol. 6800, pp. 178-184, ISBN: 978-3-642-25774-2, 2012 SCOPUS ISI
  • Cerva, P., Silovsky, J., Zdansky, J., Nouza, J., Malek, J.: Real-Time Lecture Transcription using ASR for Czech Hearing Impaired or Deaf Students, In: Proc. of Interspeech 2012, Portland, USA, 2012
  • Cerva, P., Silovsky, J., Zdansky, J., Smola, O., Blavka, K., Palecek, K., Nouza, J.: Browsing, Indexing and Automatic Transcription of Lectures for Distance Learning, In In: Proc. of IEEE conf. on Multimedia Signal Processing (MMSP), Banff, Canada, pp. 198-202, ISBN 978-146734572-9, 2012 ISI
  • Z. Koldovský, A. H. Phan, P. Tichavský, and A. Cichocki, "A Treatment of EEG Motor Imagery data by Underdetermined Blind Source Separation,"Proc. of EUSIPCO, pp. 1484-1488, ISSN: 2076-1465, Bucharest, Romania, August 27-31, 2012. SCOPUS ISI
  • S. Araki, F. Nesta, E. Vincent, Z. Koldovský, G. Nolte, A. Ziehe and A. Benichoux, "The 2011 Signal Separation Evaluation Campaign (SiSEC2011): - Audio Source Separation -," Proc. of The 10th International Conference on Latent Variable Analysis and Source Separation (LVA/ICA 2012), pp. 414-422, ISBN: 978-3-642-28550-9, Tel-Aviv, Israel, March 2012. SCOPUS
  • G. Nolte, D. Lutter, A. Ziehe, F. Nesta, E. Vincent, Z. Koldovský, A. Benichoux and S. Araki, "The 2011 Signal Separation Evaluation Campaign (SiSEC2011): - Biomedical Data Analysis -," Proc. of The 10th International Conference on Latent Variable Analysis and Source Separation (LVA/ICA 2012), pp. 423-429, ISBN: 978-3-642-28550-9, Tel-Aviv, Israel, March 2012. SCOPUS
  • J. Málek, Z. Koldovský, and P. Tichavský, "Semi-Blind Source Separation Based on ICA and Overlapped Speech Detection," Proc. of The 10th International Conference on Latent Variable Analysis and Source Separation (LVA/ICA 2012), pp. 462-469, ISBN: 978-3-642-28550-9, Tel-Aviv, Israel, March 2012. SCOPUS

2011

  • Nouza J., Cerva P., Chaloupka J.: Rainbow Bridge: Training Center Based on Voice Technology for People with Physical Disabilities, In proc. of international conference on health informatics HEALTHINF 2011 (BIODEVICES 2011), January 26-29 2011, Rome, Italy, pp. 529 - 533, ISBN 978-989842534-8, 2011 SCOPUS ISI
  • Chaloupka, J.: Design of Audio-Visual TV Broadcast News Transcription System Prototype, In proc. of 53rd International IEEE Symposium ELMAR-2011, Zadar, Croatia, pp. 209-212, ISBN 978-953-7044-12-1, 2011 SCOPUS
  • Chaloupka, J.: Automatic Video Segmentation for Czech TV Broadcast Transcription, In proc. of. 10th IEEE International workshop on Electronics, Control, Measurement and Signals (ECMS 2011), June 1-3 2011, Liberec, Czech Republic, pp. 71 - 75, ISBN 978-1-61284-395-7, 2011 SCOPUS
  • Bohac, M., Blavka, K.: Automatic segmentation and annotation of audio archive documents, In proc. of. 10th IEEE International workshop on Electronics, Control, Measurement and Signals (ECMS 2011), June 1-3 2011, Liberec, Czech Republic, pp. 61 - 66, ISBN 978-1-61284-395-7, 2011 SCOPUS
  • Prazak, J., Silovsky, J.: Speaker Diarization Using PLDA-based Speaker Clustering, In proc. of the 6th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications, 15-17 September 2011, Prague, Czech Republic, pp. 347 - 350, ISBN 978-1-4577-1423-8, 2011 SCOPUS
  • Bohac, M., Nouza, J.: Direct Magnitude Spectrum Analysis Algorithm for Tone Identification in Polyphonic Music Transcription, In proc. of the 6th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications, 15-17 September 2011, Prague, Czech Republic, pp. 373 - 478, ISBN 978-1-4577-1423-8, 2011 SCOPUS
  • Silovsky, J., Cerva, P., Zdansky, J.: Assessment of Speaker Recognition on Lossy Codecs Used for Transmission of Speech, In proc. of 53rd International IEEE Symposium ELMAR-2011, Zadar, Croatia, pp. 205-208, ISBN 978-953-7044-12-1, 2011 SCOPUS
  • Cerva, P., Palecek, K., Silovsky, J., Nouza, J.: An Investigation into VTLN for Improved Transcription of Czech Broadcast Programs, In proc. of 53rd International IEEE Symposium ELMAR-2011, Zadar, Croatia, pp. 201-204, ISBN 978-953-7044-12-1, 2011  SCOPUS
  • Málek, J., Koldovský, Z.: Fuzzy Clustering of Independent Components within Time-Domain Blind Audio Source Separation Method, In proc. of. 10th IEEE International workshop on Electronics, Control, Measurement and Signals (ECMS 2011), June 1-3 2011, Liberec, Czech Republic, pp. 44 - 49, ISBN 978-1-61284-395-7, 2011 SCOPUS
  • Koldovský, Z., Tichavský, P., Phan, A., H.: Stability Analysis and Fast Damped-Gauss-Newton Algorithm for INDSCAL Tensor Decomposition. In> IEEE Workshop on Statistical Signal Processing Proceedings , art. no. 5967765 , France, pp. 581-584, ISBN: 978-145770570-0, 2011 SCOPUS ISI
  • Koldovský, Z., Tichavský, P.: Fast and accurate methods of independent component analysis: A Survey, In: Kybernetika, Vol. 47, No. 3, pp. 426--438, June 2011. SCOPUS  ISI
  • Koldovský, Z., Tichavský, P.: Time-Domain Blind Separation of Audio Sources on the basis of a Complete ICA Decomposition of an Observation Space, IEEE Trans. on Speech, Audio and Language Processing, Vol. 19, No. 2, pp. 406-416, ISSN 1558-7916, February 2011 SCOPUS ISI
  • Chuong, N., T.: Selection of Sentence Set for Vietnamese Audio-Visual Corpus Design, In proc. of the 6th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications, 15-17 September 2011, Prague, Czech Republic, pp. 492 - 495, ISBN 978-1-4577-1423-8, 2011 SCOPUS
  • Tichavský, P., Koldovský, Z.: "Weight Adjusted Tensor Method for Blind Separation of Underdetermined Mixtures of Nonstationary Sources, " IEEE Trans. on Signal Processing, Vol. 59, No. 3, pp. 1037-1047, ISSN:1053-587X, March 2011  SCOPUS ISI
  • Tichavský, P., Koldovský, Z.: Stability of CANDECOMP-PARAFAC tensor decomposition, ICASSP 2011, pp. 4164-4167, ISBN: 978-1-4577-0537-3, Prague, Czech Republic, May 2011 SCOPUS  ISI
  • Delgado, R., L., C., Silovsky, J., Kroul, M.: Enhancement of Emotion Detection in Spoken Dialogue Systems by Combining Several Information Sources, In journal Speech Communication, Sensing Emotion and Affect - Facing Realism in Speech Processing, Volume 53, Issues 9-10, November-December 2011, pp. 1210-1228, ISSN 0167-6393,2011 SCOPUS ISI
  • Prochazka, V., Pollak, P., Zdansky, J., Nouza, J.: Performance of Czech Speech Recognition with Language Models Created from Public Resources, In RADIOENGINEERING, VOL. 18, NO. 1, pp. 1005 -1008, ISSN 1210-2512, 2011 SCOPUS  ISI
  • Chaloupka, J.: Audio-Visual Isolated Words Recognition for Voice Dialogue System, In: Analysis of Verbal and Nonverbal Communication and Enactment, Lecture Notes in Computer Science LNCS 6800, Springer, pp. 88-94, ISBN 978-3-642-25774-2, 2011
  • Nouza, J., Blavka, K., Bohac, M., Cerva, P., Zdansky, J., Silovsky, J., Prazak, J.: Voice technology to enable sophisticated access to historical Czech Radio audio archive, In proc. of International Workshop on Multimedia for Cultural Heritage (MM4CH 2011), Springer-Verlag, volume CCIS 247, May 3 2011, Modena, Italy, pp.27–38, 2011
  • Nouza J., Bohac M.: Using TTS for Fast Prototyping of Cross-Lingual ASR Applications, In: Analysis of Verbal and Nonverbal Communication and Enactment, Lecture Notes in Computer Science LNCS 6800, Springer, pp. 154-162, ISBN 978-3-642-25774-2, 2011
  • Prazak, J., Silovsky, J.: Comparison of Segmentation and Clustering Methods for Speaker Diarization of Broadcast Stream Audio, In: Analysis of Verbal and Nonverbal Communication and Enactment, Lecture Notes in Computer Science LNCS 6800, Springer, pp. 214-222, ISBN 978-3-642-25774-2, 2011
  • Silovsky, J., Prazak, J., Cerva, P., Zdansky, J., Nouza, J.: PLDA-based Clustering for Speaker Diarization of Broadcast Streams, In proc. of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy 2011, pp. 2909 - 2912, ISSN 1990-9772, 2011
  • Cerva, P., Nouza, J., Silovsky, J.: Study on Cross-lingual Adaptation of a Czech LVCSR System towards Slovak, In: Analysis of Verbal and Nonverbal Communication and Enactment, Lecture Notes in Computer Science LNCS 6800, Springer, pp. 81-87, ISBN 978-3-642-25774-2, 2011
  • Cerva, P., Palecek, K., Silovsky, J., Nouza, J.: Using Unsupervised Feature-Based Speaker Adaptation for Improved Transcription of Spoken Archives, In proc. of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy 2011, pp. 2565 - 2568, ISSN 1990-9772, 2011
  • Palecek, K.: Detection of similar advertisements in media databases, In: Analysis of Verbal and Nonverbal Communication and Enactment, Lecture Notes in Computer Science LNCS 6800, Springer, pp. 178-184, ISBN 978-3-642-25774-2, 2011
  • Nouza, J., Blavka, K., Bohac, Kucharova, M, Zdansky, J., Seps, L., Prazak J.: System for Transcribing and Accessing Historical Archive of Czech Radio. In Proc. of 5th Language & Technology conference (LTC 2011), Poznan, Poland, pp. 585, November 2011
  • Koldovský, Z., Málek, J., Tichavský, P.: Blind Speech Separation in Time-Domain Using Block-Toeplitz Structure of Reconstructed Signal Matrices, In proc. of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy 2011, pp. 561 - 564, ISSN 1990-9772, 2011
  • Koldovský, Z., Málek, J., Balík, M., Nouza, J.: CHiME Data Separation Based on Target Signal Cancellation and Noise Masking, In International Workshop on Machine Listening in Multisource Environments (CHiME Workshop - a satellite event of Interspeech 2011), pp. 47-50, Florence, Italy, Aug. 2011

2010

  • Nouza, J., Zdansky, J., Cerva, P., Silovsky, J.: Challenges in speech processing of Slavic languages (case studies in speech recognition of Czech and Slovak). In: Lecture Notes in Computer Science, Springer Verlag Berlin, Volume 5967 LNCS, 2010, pp. 225-241, ISBN 978-3-642-12396-2 SCOPUS ISI
  • Nouza, J., Silovský, J.: Adapting Lexical and Language Models for Spontaneous Czech. Text, Speech and Dialogue: 13th International Conference, TSD 2010, Brno, Czech Republic, September 6-10, 2010, pp. 377-384, ISBN 978-3-642-15759-2 SCOPUS ISI
  • Nouza, J., Zdansky, J., Cerva, P.: System for automatic collection, annotation and indexing of Czech broadcast speech with full-text search. In: proc. of the 15th IEEE Mediterranean Electrotechnical Conference - MELECON 2010. Malta, 25.-28. April 2010, pp. 202-205, ISBN: 978-1-4244-5793-9 SCOPUS ISI
  • Málek, J., Koldovský, Z., Tichavský, P.: Adaptive Time-Domain Blind Separation of Speech Signals. In: Latent Variable Analysis and Signal Separation, Lecture Notes in Computer Science, Springer, Volume 6365, pp. 9-16, ISBN: 978-3-642-15994-7 SCOPUS ISI
  • Koldovský Z., Tichavský P., Málek J., "Subband Blind Audio Source Separation Using a Time-Domain Algorithm and Tree-Structured QMF Filter Bank," in Latent Variable Analysis and Signal Separation, Lecture Notes in Computer Science, Volume 6365, pp. 25-32, ISBN: 978-3-642-15994-7, Springer, 2010. SCOPUS ISI
  • Koldovský Z., Tichavský P., Málek J., "Time-Domain Blind Audio Source Separation Method Producing Separating Filters of Generalized Feedforward Structure," in Latent Variable Analysis and Signal Separation, Lecture Notes in Computer Science, Volume 6365, pp. 17-24, ISBN: 978-3-642-15994-7, Springer, 2010. SCOPUS ISI
  • Chaloupka, J., Nouza, J.: Audio-Visual Television Broadcast Programs Processing, Transcription, Indexing and Searching, In: The 9th International Conference on Auditory-Visual Speech Processing - AVSP 2010, Japan, September, 2010, pp. 14-18, ISBN 978-4-9905475-0-9
  • Chaloupka, J.: Use of the Visual Speech Part in the Voice Dialogue Systems, In: proc. of 20th Czech-German Workshop Speech Processing, September, 2010, Prague, Czech Republic, pp. 89-93, ISBN 978-80-86269-21-4
  • Delgado, R., L., C., Silovsky, J., Griol, D.: F2 - New Technique for Recognition of User Emotional States in Spoken Dialogue Systems. In: Proceedings of the SIGDIAL 2010 Conference, September 2010, Tokyo, Japan, pp. 281-288
  • Delgado, R., L., C., Silovsky, J., Griol, D.: Enhancement of spoken dialogue systems by Means of User Emotion Recognition. Spanish Journal on Natural Language Processing, 2010, vol. 45, pp. 193-200, ISSN 1135-5948
  • Silovsky, J.: TUL NIST 2010 SRE System Description, In: Proc. of NIST 2010 Speaker Recognition Evaluation, Brno, Czech Republic, 2010, 4 pages, CD Proceedings
  • Prazak, J.: Robust Speaker Diarization. In: proc. of 20th Czech-German Workshop Speech Processing, September, 2010, Prague, Czech Republic, pp. 77-80, ISBN 978-80-86269-21-4
  • Nouza, J., Cerva, P., Novy, J.: Hlasové ovládání počítače - nová alternativa pro osoby s motorickým postižením, V časopisu: Speciální pedagogika, číslo 2, 2010, pp. 87-97, ISSN 1211-2720
  • Hnilička O., Málek J., Paleček K., Koldovský Z., "A Fast C++ Implementation of Time-domain Blind Speech Separation Algorithm", Proc. 20th Czech-German Workshop on Speech Processing, Prague, 2010.

2009

  • Chaloupka, J., Nouza, J., Zdansky, J., Cerva, P., Silovsky, J., Kroul, M.: Voice Technology Applied for Building a Prototype Smart Room, In Lecture Notes in Artificial Inteligence, LNAI 5398, Springer-Verlag Berlin, 2009, pp. 104-111, ISBN 978-3-642-00524-4 SCOPUS ISI
  • Chaloupka, J., Chaloupka, Z.: Czech Artificial Computerized Talking Head George, In Lecture Notes in Artificial Inteligence, LNAI 5641, Springer-Verlag Berlin, 2009, pp. 324-330, ISBN 978-3-642-03319-3 SCOPUS ISI
  • Silovsky, J., Cerva, P., Zdansky, J.: MLLR Transforms Based Speaker Recognition in Broadcast Streams, LNAI 5641, Springer-Verlag Berlin, 2009, pp. 423-431, ISBN 978-3-642-03319-3 SCOPUS IS
  • Nouza, J., Cerva, P., Zdansky, J.: Very Large Vocabulary Voice Dictation for Mobile Devices, In proc. of Interspeech 2009 - Speech and Intelligence, 6-10 September 2009 . Brighton, UK, pp. 995-998, ISSN 1990-9772 SCOPUS ISI
  • Silovský, J., Červa, P., Žďánský, J.: Comparison of Generative and Discriminative Approaches for Speaker Recognition with Limited Data. In RADIOENGINEERING, Vol. 18, No. 3, September 2009, pp. 307-316, ISSN 1210-2512, 2009 SCOPUS ISI
  • Nouza, J., Silovský, J.: Fast Keyword Spotting in Telephone Speech. In RADIOENGINEERING, Vol. 18, No. 4, December 2009, pp. 665-670, ISSN 1210-2512, 2009 SCOPUS ISI
  • P. Tichavský, A. Yeredor, and Z. Koldovský: "A Fast Asymptotically Efficient Algorithm for Blind Separation of a Linear Mixture of Block-Wise Stationary Autoregressive Processes," ICASSP 2009, pp. 3133-3136, ISBN: 978-1-4244-2354-5, ISSN: 1520-6149, Taipei, Taiwan, April 2009. SCOPUS ISI
  • J. Petkov and Z. Koldovský: "BSSGUI A Package for Interactive Control of Blind Source Separation Algorithms in MATLAB," in Cross-Modal Analysis of Speech, Gestures, Gaze and Facial Expressions (Eds.: A. Esposito and R. Vích), pp. 386-398, ISBN: 978-3-642-03319-3, ISSN: 0302-9743, Springer Berlin / Heidelberg, July 2009. SCOPUS ISI
  • Z. Koldovský, J. Málek, P. Tichavský, Y. Deville, and S. Hosseini: "Blind Separation of Piecewise Stationary NonGaussian Sources", Signal Processing, Volume 89, Issue 12, Pages 2570-2584, ISSN 0165-1684, December 2009 SCOPUS ISI
  • Kroul, M.: Automatic Detection of Emphasized Words for Performance Enhancement of a Czech ASR System, In: Proc. Of 13th International Conference Speech and Computer (Specom 2009), St. Petersburg, Russia, 2009, pp. 470-473, ISBN 978-5-8088-0442-5
  • Chuong, N., T., Chaloupka, J.: Improvement of Constraint in Active Appearance Model Fitting Algorithm and Its Application in Face Tracking, In: Proc. of 9th International Workshop on Electronics, Control, Modelling, Measurement and Signals, Mondragon, Spain, 2009, pp. 35-40, ISBN 978-84-608-0941-8
  • Silovsky, J., Cerva, P.: Analysis of Eigenchannel Adaptation in Broadcast News Speaker Recognition System, In: Proc. of 9th International Workshop on Electronics, Control, Modelling, Measurement and Signals, Mondragon, Spain, 2009, pp. 165-171, ISBN 978-84-608-0941-8
  • Cerva, P., Zdansky, J., Silovsky, J.: Istudy on the Use of Speaker Adaptation Methods for Motor-handicapped Persons with a Speech Defect, In: Proc. of 9th International Workshop on Electronics, Control, Modelling, Measurement and Signals, Mondragon, Spain, 2009, pp. 67-73, ISBN 978-84-608-0941-8
  • Chaloupka, J.: Artificial Interpreter, In: proc. of: 19th Czech-German Workshop Speech Processing, September, 2009, Prague, Czech Republic, pp. 48-51, ISBN 978-80-86269-18-4, 2009
  • Callejas, Z., Nouza, J., Červa, P., López-Cózar, R.:Cost-Efficient Cross-Lingual Adaptation of a Speech Recognition System. In Advances in inteligent and soft computing 57, Computer Recognition Systems 3, Springer-Verlag Berlin, 2009, pp. 331-338, ISSN 1867-5662, 2009
  • Nouza, J.: Využití hlasových technologií v praxi, kniha: Uživatelská přívětivá rozhraní, vydavatel Horava and Associates, Praha, CZ, 2009, pp. 150-163, ISBN 978-80-254-5295-0
  • Nouza, J. a kol.: Řeč a počítač: principy hlasové komunikace, úlohy, metody a aplikace, vydavatel: Technická univerzita v Liberci, editoři: Jan Nouza, Zbyněk Koldovský, Robert Vích, číslo publikace: 55 110-09, první vydání, počet stran: 238, ISBN 978-80-7372-548-8
  • Z. Koldovský and P. Tichavský: "A Comparison of Independent Component and Independent Subspace Analysis Algorithms," EUSIPCO 2009 , pp. 1447-1451, Glasgow, Scotland, August 24-28, 2009.

2008

  • Zdansky, J., Chaloupka, J., Nouza, J.: Joint Audio-Visual Processing, Representation and Indexing of TV News Programmes, In Proceedings of IEEE 10th Workshop on Multimedia Signal Processing (MMSP 2008), 8-10 October 2008, Cairns, Australia, pp.: 960-965, ISBN 978-1-4244-2295-1 SCOPUS ISI
  • Málek, J., Koldovský, Z., Žďánský, J., Nouza, J.: Enhancement of Noisy Speech Recordings via Blind Source Separation. In Proceedings of the 9th Annual Conference of the International Speech Communication Association, (Interspeech 2008), pp. 159-162, ISSN: 1990-9772, September 22-26, Brisbane, Australia, 2008 SCOPUS ISI
  • Nouza, J., Silovsky, J., Zdansky, J., Cerva, P., Kroul, M., Chaloupka, J.: Czech-to-Slovak Adapted Broadcast News Transcription System. In Proceedings of the 9th Annual Conference of the International Speech Communication Association, (Interspeech 2008), pp. 2683-2686, ISSN: 1990-9772, September 22-26, Brisbane, Australia, 2008 SCOPUS ISI
  • Cerva, P., Zdansky, J., Silovsky, J., Nouza, J.: Study on Speaker Adaptation Methods in the Broadcast News Transcription Task. In: Lecture Notes in Artificial Intelligence, Text, Speech and Dialogue, LNAI 5246, Springer-Verlag, 2008, pp. 277-284, ISSN 0302-9743 SCOPUS ISI
  • Lopez-Cozar, R., Callejas, Z., Kroul, M., Nouza, J., Silovsky, J.: Two-Level Fusion to Improve Emotion Classification in Spoken Dialogue Systems. In: Lecture Notes in Artificial Intelligence, Text, Speech and Dialogue, LNAI 5246, Springer-Verlag, 2008, pp. 617-624, ISSN 0302-9743 SCOPUS ISI
  • Vích, R., Nouza, J., Vondra, M: Automatic Speech Recognition Used for Intelligibility Assessment of Text-to-Speech Systems, In LNAI 5042, Springer-Verlag Berlin, pp. 136-148, 2008 SCOPUS ISI
  • TICHAVSKY, P., KOLDOVSKY, Z., YEREDOR, A., HERRERO, G., G., DORON, E.: "A Hybrid Technique for Blind Non-Gaussian and Time-Correlated Sources Using a Multicomponent Approach", In IEEE Trans. on Neural Networks, pages: 421-430, ISSN: 1045-9227, 2008 SCOPUS ISI
  • Chaloupka, J.: Various Methods for Visual Speaker Identification for Automatic Continuous Speech Recognition in TV Broadcast Program, In the 6th International Conference on Informatics and Systems, IEEE, Egypt, pp. MM 1-5, ISBN 977-403-290-X, 2008
  • Nouza, J., Zdansky, J.: Automatic Alignment between Speech Records and Their Text Transcriptions for Audio Archive Indexing and Searching, In the 6th International Conference on Informatics and Systems, IEEE, Egypt, pp. MM 6-12, ISBN 977-403-290-X, 2008
  • Zdansky, J.: SDROLA: An Efficient Strategy for Distributed, Accurate Indexing of Spoken Documents, In the 6th International Conference on Informatics and Systems, IEEE, Egypt, pp. PAR 24-28, ISBN 977-403-290-X, 2008
  • Chaloupka, J., Nouza, J., Zdansky, J.: Audio-Visual Voice Command Recognition in Noisy Conditions, In Proceedings of International Conference on Auditory-Visual Speech Processing (AVSP 2008), 26-29 September 2008, Australia, pp.: 25-30, ISBN 978-0-646-49504-0
  • Cerva, P., Nouza, J.: MyDictate - praktický program pro diktování do počítače, In: Internet a informační systémy pro osoby se specifickými potřebami - INSPO 2008, Praha, 8. března 2008
  • Nouza, J., Zdansky, J., Cerva, P.: Automatic collection, annotation and indexing of Czech broadcast speech, In Perspectives on Slavistics III Conference, Hamburk, pp. 45-46, August 28-31, 2008

2007

  • CERVA, P., NOUZA, J.: Design and Development of Voice Controlled Aids for Motor-Handicapped Persons, In: Conference of the International Speech Communication Association (Interspeech 2007), pp. 2521 - 2524, August 2007. ISSN: 1990-9772 SCOPUS ISI
  • KOLDOVSKY, Z., TICHAVSKY, P.: "Time-Domain Blind Audio Source Separation using Advanced ICA Methods", Proceedings of 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), pp. 846-849, August 2007. ISSN: 1990-9772 SCOPUS ISI
  • CHALOUPKA, J.: Extraction of the Visual Features from the Audio-Visual Speech Signal and the Utilization of these Features for the Speaker Identification, In: Advances in Soft Computing 45, Computer Recognitions Systems 2, Springer-Verlag, pp. 413-420, ISSN 1615-3871 SCOPUS
  • KROUL, M.: Automatic Speech Segmentation Based on HMM. In: Radioengineering - June 2007, Volume 16, Nr. 2, ISSN 1210-2512 ISI
  • MALEK, J., NOUZA, J., KLIMOVIČ, T. : Automatic Classifiers for Medical Data from Doppler Unit, In: Radioengineering Vol.16, No.2, June 2007, pp.62-66, ISSN 1210-2512 ISI
  • CHALOUPKA, J., ZDANSKY, J.: STV: An Efficient Tool for Fast Broadcast Programs Transcript Acquisition, In : 8th International Workshop on Electronics, Control, Modelling, Measurement and Signals - ECMS 2007, Liberec, Czech Republic, May 21-23, 2007, pp. 62 - 66, ISBN 978-80-7372-218-0
  • NOUZA, J., ZDANSKY, J., CHALOUPKA, J., CERVA, P., DRABKOVA, J., KOLDOVSKY, Z., NEJEDLOVA, D., KROUL, M., SILOVSKY, J.: Speech Technology Research and Development at Technical University of Liberec - State in 2007, In : 8th International Workshop on Electronics, Control, Modelling, Measurement and Signals - ECMS 2007, Liberec, Czech Republic, May 21 - 23, 2007, pp. 18 - 26, ISBN 978-80-7372-218-0
  • MALEK, J. , KOLDOVSKY, Z. , HOSSEINI, S. , DEVILLE, Y. : A Variant of EFICA Algorithm With Adaptive Parametric Density Estimator, In : 8th International Workshop on Electronics, Control, Modelling, Measurement and Signals - ECMS 2007, Liberec, Czech Republic, May 21 - 23, 2007, pp. 79 - 84, ISBN 978-80-7372-218-0
  • ZDANSKY, J.: A Cluster-based System for Fast Automatic Transcription of Large Spoken Document Archives, In : 8th International Workshop on Electronics, Control, Modelling, Measurement and Signals - ECMS 2007, Liberec, Czech Republic, May 21 - 23, 2007, pp. 109 - 112, ISBN 978-80-7372-218-0
  • HOLADA, M., PELC, M., KOPETSCHKE, I., PIRKL, P., MATELA, L., STILEC, J.: Voice Interactive Control System for Robots with Distributed Components, In : 8th International Workshop on Electronics, Control, Modelling, Measurement and Signals - ECMS 2007, Liberec, Czech Republic, May 21 - 23, 2007, pp. 160 - 163, ISBN 978-80-7372-218-0
  • HOLADA, M., KOPETSCHKE, I., PIRKL, P., PELC, M., MATELA, L., HORCICKA, J., STILEC, J.: THE PROTOTYPE OF HUMAN – ROBOT INTERACTIVE VOICE CONTROL SYSTEM. In: Proc. of the Fourth International Conference on Informatics in Control, Automation and Robotics (ICINCO 2007), May, 2007, Angers, France, vol. RA-1, pp. 307-310, ISBN: 978-972-8865-83-2
  • MALEK, J.: BAYESIAN CLASSIFIER FOR MEDICAL DATA FROM DOPPLER UNIT, Acta Polytechnica Vol.46, No. 4/2006,pp.21-22, ISSN 1210-2709
  • TICHAVSKY, P., KOLDOVSKY, Z., OJA, E.: "Speed and Accuracy Enhancement of Linear ICA Techniques Using Rational Nonlinear Functions", Proceedings of 7th International Conference on Independent Component Analysis (ICA2007), pp. 285-292, Sept. 2007. ISBN 978-3-540-74493-1
  • KOLDOVSKY, Z., TICHAVSKY, P.: "Blind Instantaneous Noisy Mixture Separation with Best Interference-plus-noise Rejection", Proceedings of 7th International Conference on Independent Component Analysis (ICA2007), pp. 730-737, Sept. 2007. ISBN 978-3-540-74493-1
  • HERRERO, G., G., KOLDOVSKY, Z., TICHAVSKY, P., EGIAZARIN, K.: "A Fast Algorithm for Blind Separation of Non-Gaussian and Time-Correlated Signals", Proceedings of 15th European Signal Processing Conference (EUSIPCO 2007), pp. 1731-1735, Sept 2007. ISBN 978-83-921340-2-2
  • NEJEDLOVÁ, D., ŽĎÁNSKY, J.: INITIAL RESEARCH IN TOPIC-DEPENDENT LANGUAGE MODEL FOR CZECH BROADCAST NEWS TRANSCRIPTION. In: 17th Czech-German Workshop Speech Processing, September, 2007, Prague, Czech Republic, pp. 24-30, ISBN 978-80-86269-00-9
  • CHALOUPKA, J.: New Version of Czech Computerized Talking Head, In: 17th Czech-German Workshop Speech Processing, September, 2007, Prague, Czech Republic, pp. 173-176, ISBN 978-80-86269-00-9
  • SILOVSKY, J., CERVA, P., ZDANSKY, J.: Text-Independent Speaker Verification Supported by ASR, In: 17th Czech-German Workshop Speech Processing, September, 2007, Prague, Czech Republic, pp. 31-36, ISBN 978-80-86269-00-9
  • NOUZA, J., VICH, R., VONDRA, M.: Can ASR Be Used for Evaluating Speech Quality? In: 17th Czech-German Workshop Speech Processing, September, 2007, Prague, Czech Republic, pp. 115-121, ISBN 978-80-86269-00-9
  • KROUL, M., NOUZA, J.: Voice Anonymization, In: 17th Czech-German Workshop Speech Processing, September, 2007, Prague, Czech Republic, pp. 134-137, ISBN 978-80-86269-00-9
  • DRABKOVA, J.: Voice Operated Testing of Knowledge Built in the Lotos Systém, In: 17th Czech-German Workshop Speech Processing, September, 2007, Prague, Czech Republic, pp. 19-23, ISBN 978-80-86269-00-9
  • HOLADA, M., PELC, M.: Optimizing Distributed Speech Recognition System for Multi-User Real-Time Usage, In: Speech and Computer International Conference - Specom 2007, October, 2007, Moscow, Russia, pp. 455-459, ISBN 6-7452-0110-X
  • CHALOUPKA, J.: Visual Speaker Identification for the Automatic TV Broadcast News Transcription, In: Speech and Computer International Conference - Specom 2007, October, 2007, Moscow, Russia, pp. 639-644, ISBN 6-7452-0110-X
  • NOUZA, J., CHALOUPKA, J., ZDANSKY, J., SILOVSKY, J., KROUL, M., MADER, Z.: Voice Controlled Center for Homes of Motor-Handicapped Persons, In: Speech and Computer International Conference - Specom 2007, October, 2007, Moscow, Russia, pp. 714-719 , ISBN 6-7452-0110-X
  • ZDANSKY, J.: Acoustic Model Management Strategies for Improved Automatic Transcription of Broadcast Programs. In: Speech and Computer International Conference - Specom 2007, October, 2007, Moscow, Russia, pp.503-508, ISBN 6-7452-0110-X
  • CALLEJAS, Z., NOUZA J., CERVA, P., LOPEZ-COZAR, R: "MyVoice goes Spanish. Cross-lingual adaptation of a voice controlled PC tool for handicapped people", Procesamiento del Lenguaje Natural vol. 39 (2007), pp. 277-278. ISSN 1135-5948


2006

  • KOLDOVSKÝ, Z., TICHAVSKÝ, P.: Methods of Fair Comparison of Performance of Linear ICA Techniques in Presence of Additive Noise, In: ICASSP 2006, May, 2006, Toulouse, France, no. V., pp. 873-876, ISBN 1-4244-0469-X SCOPUS ISI
  • CHALOUPKA, J.: Visual Speech Segmentation and Speaker Recognition for Transcription of TV News. In: International Conference on Spoken Language Processing Interspeech 2006 — ICSLP 2006, September, 2006, Pittsburgh, USA, pp. 1284-1287, ISSN 1990-9772  SCOPUSISI
  • KOLDOVSKÝ, Z., NOUZA, J., KOLORENČ, J.: Continuous Time-Frequency Masking Method for Blind Speech Separation with Adaptive Choice of Threshold Parameter Using ICA. In: International Conference on Spoken Language Processing Interspeech 2006 — ICSLP 2006, September, 2006, Pittsburgh, USA, pp. 2578-2581, ISSN 1990-9772 SCOPUSISI
  • ŽĎÁNSKY, J.: BINSEG: An Efficient Speaker-based Segmentation Technique. In: International Conference on Spoken Language Processing Interspeech 2006 — ICSLP 2006, September, 2006, Pittsburgh, USA, pp. 2182-2185, ISSN 1990-9772 SCOPUSISI
  • ČERVA, P., NOUZA, J., SILOVSKÝ, J.: Two-Step Unsupervised Speaker Adaptation Based on Speaker and Gender Recognition and HMM Combination. In: International Conference on Spoken Language Processing Interspeech 2006 — ICSLP 2006, September, 2006, Pittsburgh, USA, pp. 2326-2329, ISSN 1990-9772 SCOPUSISI
  • NOUZA, J., ŽĎÁNSKY, J., ČERVA, P., KOLORENČ, J.: Continual On-line Monitoring of Czech Spoken Broadcast Programs. In: International Conference on Spoken Language Processing Interspeech 2006 — ICSLP 2006, September, 2006, Pittsburgh, USA, pp. 1650-1653, ISSN 1990-9772 SCOPUSISI
  • NOUZA, J., ŽĎÁNSKY, J., ČERVA, P., KOLORENČ, J.: A System for Information Retrieval from Large Records. In: Lecture Notes in Artificial Intelligence, Text, Speech and Dialogue, LNAI 3206, Springer-Verlag, pp. 485-492, ISBN 3-540-39090-1 SCOPUS ISI
  • SILOVSKÝ, J., NOUZA, J.: Speech, Speaker and Speaker's Gender Identification in Automatically Processed Broadcast Stream. In: Radioengineering, Proceedings of Czech and Slovak Technical Universities and URSI Committees, Volume 15, Number 3, September 2006, pp. 42-48, ISSN 1210-2512ISI
  • CHALOUPKA, J.: Fast Recognition of the Visual Speech Signal with the Help of the HMMs. In: Proc. of Radioelektronika 2006, April 2006, Bratislava, Slovak Republic, pp. 165-168, ISBN 80-227-2388-6
  • MÁLEK, J.: Classification of Medical Data from Doppler Probe. In: 10th International Student Conference on Electrical Engineering - POSTER 2006, CD-ROM Proceedings, May 2006, Prague, Czech Republic
  • KROUL, M.: Prosody Analysis for Automatic Transcription of Speech. In: 10th International Student Conference on Electrical Engineering - POSTER 2006, CD-ROM Proceedings, May 2006, Prague, Czech Republic
  • CHALOUPKA, J.: CREATION AND SELECTION OF THE VISUAL FRONT END FEATURES AND THE AUDIO-VISUAL FEATURE FUSION FOR AUDIO-VISUAL SPEECH RECOGNITION. In: Speech and Computer International Conference - Specom 2006, June, 2006, St. Petersburg, Russia, pp. 499-502, ISBN 5-7452-0074-x
  • ŽĎÁNSKÝ, J.: SPEAKER CHANGE DETECTION VIA BINARY SEGMENTATION TECHNIQUE AND INFORMATIONAL APPROACH. In: Speech and Computer International Conference - Specom 2006, June, 2006, St. Petersburg, Russia, pp. 386-389, ISBN 5-7452-0074-x
  • KOLORENČ, J., NOUZA, J., ČERVA, P.: MULTI-WORDS IN THE CZECH TV/RADIO NEWS TRANSCRIPTION SYSTEM. In: Speech and Computer International Conference - Specom 2006, June, 2006, St. Petersburg, Russia, pp. 70-74, ISBN 5-7452-0074-x
  • ČERVA, P., NOUZA, J., KOLORENČ, J., DAVID, P.: IMPROVED TRANSCRIPTION OF CZECH PARLIAMENT SPEECHES BY ACOUSTIC AND LANGUAGE MODEL ADAPTATION. In: Speech and Computer International Conference - Specom 2006, June, 2006, St. Petersburg, Russia, pp. 103-106, ISBN 5-7452-0074-x
  • DRÁBKOVÁ, J., NEJEDLOVÁ, D.: Class-based language model application for Czech language. In: 16th Czech-German Workshop Speech Processing, September, 2006, Prague, Czech Republic, ISBN 80-86269-15-9
  • CHALOUPKA, J.: Multimodal speech processing and recognition for the creation of the communicative-interactive sytems. In: 16th Czech-German Workshop Speech Processing, September, 2006, Prague, Czech Republic, ISBN 80-86269-15-9 "
  • NOUZA, J..: An introductory course on speech processing for untergraduate students. In: 16th Czech-German Workshop Speech Processing, September, 2006, Prague, Czech Republic, ISBN 80-86269-15-9
  • SILOVSKÝ, J., ČERVA, P.: Study on speaker recognition aided broadcast streams transcription. In: 16th Czech-German Workshop Speech Processing, September, 2006, Prague, Czech Republic, ISBN 80-86269-15-9
  • BOŘIL, H., ČERVA, P., ŽĎÁNSKÝ, J.: Lombard speech recognition: A comparative study. In: 16th Czech-German Workshop Speech Processing, September, 2006, Prague, Czech Republic, ISBN 80-86269-15-9
  • DRÁBKOVÁ, J.: Tvorba jazykového modelu založeného na třídách. Disertační práce, FM, TUL, Liberec 2006
  • NEJEDLOVÁ, D.: Tvorba slovníků a jazykových modelů pro automatický přepis zpravodajských pořadů. Disertační práce, FM, TUL, Liberec 2006
  • DAVID, P.: Identifikace audiosegmentů pro automatickou transkripci zpravodajských pořadů. Disertační práce, FM, TUL, Liberec 2006


2005

  • ŽĎÁNSKÝ, J..: Novel Algorithm for Speaker Segmentation of TV Broadcast News. In: Proc. of Radioelektronika 2005, May 2005, Brno, Czech Republic, pp. 354-357, ISBN 80-214-2904-6
  • CHALOUPKA, J.: Extraction of the Visual Features by Discrete Cosine Transform for Audio-Visual Speech Recognition. In: Proc. of Radioelektronika 2005, May 2005, Brno, Czech Republic, pp. 467-470, ISBN 80-214-2904-6
  • ČERVA, P.: REDUCTION OF UNIMPORTANT GAUSSIAN COMPONENTS IN SPEAKER ADAPTED CONTINUOUS SPEECH RECOGNITION SYSTEMS. In: 7th International Workshop on Electronics, Control, Modelling, Measurement and Signals, CD proceedings, May 17-20, 2005, Tolouse, France
  • KOLORENČ, J.: Enhancing Czech Speech Recognizer with Morphological Analyzer. In: 7th International Workshop on Electronics, Control, Modelling, Measurement and Signals, CD proceedings, May 17-20, 2005, Tolouse, France
  • CHALOUPKA, J., NOUZA, T.: The Multimodal Project of the Artificial Conversation Agent Chatter Using the Graphic Designing and Developing Voice Dialog System LOTOS. In: 7th International Workshop on Electronics, Control, Modelling, Measurement and Signals, CD proceedings, May 17-20, 2005, Tolouse, France
  • DAVID, P., ČERVA, P., NOUZA, J.: OPTIMIZED CONFIGURATION OF SPEAKER RECOGNITION SYSTEM FOR BROADCAST NEWS TRANSCRIPTION. In: 7th International Workshop on Electronics, Control, Modelling, Measurement and Signals, CD proceedings, May 17-20, 2005, Tolouse, France
  • NOUZA, J., ŽĎÁNSKÝ, J., DAVID, P., ČERVA, P.,KOLORENČ, J., NEJEDLOVÁ, D.: Fully Automated System for Czech Spoken Broadcast Transcription with Very Large (300K+) Lexicon. In: Interspeech 2005, September, 2005, Lisboa, Portugal, pp. 1681-1684, ISSN 1018-4074
  • ŽĎÁNSKÝ, J., NOUZA, J.: Detection of Acoustic Change-Points in Audio Records via Global BIC Maximization and Dynamic Programming. In: Interspeech 2005, September, 2005, Lisboa, Portugal, pp. 669-672, ISSN 1018-4074
  • ZIBERT, J., MIHELIC, F., MARTENS, J.-P., MEINEDO, H., NETO, J., DOCIO, L., GARCIA-MATEO, C., DAVID, P., ZDANSKY, J., PLEVA, M., CIZMAR, A., ZGANK, A., KACIC, Z., TELEKI, C., VICSI, K.: The COST278 Broadcast News Segmentation and Speaker Clustering Evaluation - Overview, Methodology, Systems, Results. In: Interspeech 2005, September, 2005, Lisboa, Portugal, pp. 629-632, ISSN 1018-4074"
  • ČERVA, P., NOUZA, J.: Supervised and unsupervised speaker adaptation in large vocabulary continuous speech recognition of Czech. In: TSD 2005, September, 2005, Karlovy Vary, Czech Republic, pp. 203-210, ISBN 3-540-28789-2
  • NOUZA, J.: Discrete and Fluent Voice Dictation in Czech Language. In: TSD 2005, September, 2005, Karlovy Vary, Czech Republic, pp. 273-280, ISBN 3-540-28789-2
  • ČERVA, P., DAVID, P., NOUZA, J.: Acoustic Modeling Based on Speaker Recognition and Adaptation for Improved Transcription of Broadcast Programs. In: Specom 2005, October, 2005, Patras, Greece, pp. 183-186, ISBN 5-7452-0110-x
  • NOUZA, J., NOUZA, T., ČERVA, P.: A Multi-Functional Voice-Control Aid for Disabled Persons. In: Specom 2005, October, 2005, Patras, Greece, pp. 715-718, ISBN 5-7452-0110-x
  • CHALOUPKA, J.: Fast Method for Extraction of the Visual Speech Features for Audio-Visual Speech Recognition. In: Specom 2005, October, 2005, Patras, Greece, pp. 215-218, ISBN 5-7452-0110-x
  • KOLORENČ, J.: Automatic Punctuation of Automatically Recognized Speech. In: Electronic Speech Signal Processing 2005, Semtember, 2005, Prague, Czech Republic, pp. 291-297, ISBN 3-938863-17-X
  • DRÁBKOVÁ, J.: Punctuation Effect on Classed-Based Language Model for Czech Language. In: Electronic Speech Signal Processing 2005, Semtember, 2005, Prague, Czech Republic, pp. 267-272, ISBN 3-938863-17-X
  • NEJEDLOVÁ, D., DRÁBKOVÁ, J., KOLORENČ, J., NOUZA, J.: Lexical, Phonetic, and Grammatical Aspects of Very-Large-Vocabulary Continuous Speech Recognition of Czech Language. In: Electronic Speech Signal Processing 2005, Semtember, 2005, Prague, Czech Republic, pp. 224-231, ISBN 3-938863-17-X
  • CHALOUPKA, J.: Czech collection of the Visemes for the Automatic Audio-Visual Speech Recognition. In: Electronic Speech Signal Processing 2005, Semtember, 2005, Prague, Czech Republic, pp. 219-223, ISBN 3-938863-17-X
  • HOLADA, M., SILOVSKÝ, J.: The PDF Based Compression Methods for Features Vectors in DSR Systems. In: Electronic Speech Signal Processing 2005, Semtember, 2005, Prague, Czech Republic, pp. 232-236, ISBN 3-938863-17-X
  • HOLADA, M., PELC, M.: Distributed Speech Recognition System Using Parallel Processing. In: Electronic Speech Signal Processing 2005, Semtember, 2005, Prague, Czech Republic, pp. 273-276, ISBN 3-938863-17-X
  • NOUZA, J., ČERVA, P., ŽĎÁNSKÝ, J., KOLORENČ, J., DAVID, P.: Towards automatic transcription of parliament speech. In: Electronic Speech Signal Processing 2005, Semtember, 2005, Prague, Czech Republic, pp. 237-244, ISBN 3-938863-17-X
  • ŽĎÁNSKÝ, J.: Detection of Acouustic Change-Points in Audio Streams and Signal Segmentation. In Radioengineering, vol. 14, no. 1, april 2005, pp. 37-40, ISSN 1210-2512
  • HOLADA, M., NOUZA, J., ČERVA, P., NOUZA, T., PELC, M.: Distributed Recognition Used as Platform for Public Testing of Speech Technology Applications. In ASIDE2005 ISCA ITRW and COST278 Final Workshop on Applied Spoken Language Interaction in Distributed Environments, November, 2005, Aalborg University, Aalborg, Denmark, ISBN: 87-90834-85-2
  • NOUZA, J., NOVOTNÝ, Z., KOLORENČ, J., SVAČINA, Š.: MOŽNOSTI POUŽITÍ MODERNÍCH HLASOVÝCH TECHNOLOGIÍ V LÉKAŘSKÉ PRAXI
  • CHALOUPKA, J.: Rozpoznávání akustického signálu řeči s podporou vizuální informace. Disertační práce, FM, TUL, Liberec 2005
  • ŽĎÁNSKÝ, J.: Metody detekce změny mluvčího v akustickém signálu. Disertační práce, FM, TUL, Liberec 2005


2004

  • ČERVA, P.: Study on Different Speaker Adaptation Approaches in Isolated-Word Speech Recognition of Czech. In: Proc. of 14th Czech-German Workshop „Speech Processing", September 2004, Prague, Czech Republic, pp. 61-65, ISBN 80-86269-11-6
  • DRÁBKOVÁ, J., HOLADA, M., NOUZA, J., HORÁK, P., NOUZA, T.: New Version of Phone Dialogue Information System InfoCity.In: Proc. of 14th Czech-German Workshop „Speech Processing", September 2004, Prague, Czech Republic, pp. 66-71, ISBN 80-86269-11-6
  • DAVID, P., ČERVA, P., NOUZA, J.: Speaker Recognition Applied for Enhanced Broadcast News Transcription. In: Proc. of 14th Czech-German Workshop „Speech Processing", September 2004, Prague, Czech Republic, pp. 72-76, ISBN 80-86269-11-6
  • CHALOUPKA, J.: Initial Experiments with Audio-Visual Isolated Words Recognition.In: Proc. of 14th Czech-German Workshop „Speech Processing", September 2004, Prague, Czech Republic, pp. 77-81, ISBN 80-86269-11-6
  • NEJEDLOVÁ, D.: Lexicon and Language Model Building for Czech Very-Large-Vocabulary Speech recognition. In: Proc. of 14th Czech-German Workshop „Speech Processing", September 2004, Prague, Czech Republic, pp.82-92, ISBN 80-86269-11-6
  • KOLORENČ, J., KLIMOVIČ, T.: Cardiology Language Model for Voice Dictation. In: Proc. of 14th Czech-German Workshop „Speech Processing", September 2004, Prague, Czech Republic, pp. 93-97, ISBN 80-86269-11-6
  • KOLÁŘ, P.: An Extensible Morphology Module of Czech. In: Proc. of 14th Czech-German Workshop „Speech Processing", September 2004, Prague, Czech Republic, pp. 98-101, ISBN 80-86269-11-6
  • HOLADA, M.: The experiences and usability of distributed speech recognition system DUNDIS. In: Proc. of 14th Czech-German Workshop „Speech Processing", September 2004, Prague, Czech Republic, pp. 159-162, ISBN 80-86269-11-6
  • VANDECATSEYE, A., MARTENS, J., NETO, J., MEINEDO, H., MATEO, C., DIEGUEZ, J., MIHELIC, F., ZIBERT, J., NOUZA, J., DAVID, P., PLEVA, M., CIZMAR, A., PAPAGEORGIOU, H., ALEXANDRIS, C.: The COST278 pan-European Broadcast News Database. In Proc. of the LREC 2004, Lisbon, Portugal, May 2004, pages 873–876., ISBN 2-9517408-1-6
  • NOUZA, J., NEJEDLOVÁ, D., ŽĎÁNSKÝ, J., KOLORENČ, J.: Very Large Vocabulary Speech Recognition System for Automatic Transcription of Czech Broadcast. In: Proc. of ICSLP 2004, October 2004, Jeju Island, Korea, pp. 409-412, ISSN 1225-441x,
  • ŽĎÁNSKÝ, J., DAVID, P., NOUZA, J.: An Improved Preprocessor for the Automatic Transcription of Broadcast News Audio Stream. In: Proc. of ICSLP 2004, October 2004, Jeju Island, Korea, pp. 1065-1068, ISSN 1225-441x,
  • CHALOUPKA, J.: Automatic Lips Reading for Audio-Visual Speech Processing and Recognition. In: Proc. of ICSLP 2004, October 2004, Jeju Island, Korea, pp. 2505-2508, ISSN 1225-441x,
  • NOUZA, J., ŽĎÁNSKÝ, J., DAVID, P.: Fully Automated Approach to Broadcast News Transcription in Czech Language. In: Text, Speech and Dialogue. Lecture Notes in Artificial Intelligence. Springer-Verlag Berlin 2004, pp. 401-408, ISBN 3-540-23049-1, ISSN 0302-9743.
  • ČERVA, P., NOUZA, J.: MAP Based Speaker Adaptation in Very Large Vocabulary Speech Recognition of Czech. Radioengineering, September 2004, Vol. 13, No 3, pp. 42-46, ISSN 1210-2512
  • KOLORENČ, J.: Evolving Phonological Rules Using Grammatical Evolution. In: 8th International Student Conference on Electrical Engineering - POSTER 2004 [CD-ROM], May 2004, Prague, Czech Republic,
  • ŽĎÁNSKÝ, J., KROUL, M.: Semi-Automatic Non-speech Events Database Formation. In: 8th International Student Conference on Electrical Engineering - POSTER 2004 [CD-ROM], May 2004, Prague, Czech Republic
  • ŽĎÁNSKÝ, J., DAVID, P.: Automatic Audio Segmentation of Tv Broadcast News. In: Proc. of Radioelektronika 2004, April 2004, Bratislava, Slovak Republic, pp. 358-361, ISBN 80-227-2017-8
  • PELOUCH, O., HOLADA, M.: The Compression of Recognition Feature Vectors for Distributed ASR. In: Proc. of Radioelektronika 2004, April 2004, Bratislava, Slovak Republic, pp. 382-385, ISBN 80-227-2017-8
  • CHALOUPKA, J.: Visual Signal Processing for Speech Recognition. In: Proc. of Radioelektronika 2004, April 2004, Bratislava, Slovak Republic, pp. 406-409, ISBN 80-227-2017-8
  • ČERVA, P., NOUZA, J.: Map Based Speaker Adaptation in Large Vocabulary Speech Recognition of Czech Language. In: Proc. of Radioelektronika 2004, April 2004, Bratislava, Slovak Republic, pp. 108-111, ISBN 80-227-2017-8
  • ČERVA, P., ŠKODA, J., NOUZA, J.: Building and Annotating Large Speech Databases for Automatic Speech Recognition. In: Proc. of Radioelektronika 2004, April 2004, Bratislava, Slovak Republic, pp. 386-389, ISBN 80-227-2017-8
  • CHALOUPKA, J., NOUZA, J.: Speech Recognition Supported by Camera Lips Reading. In: Proc. of ICCCT 2004, August 2004, Austin, USA, pp. 116-119, ISBN 980-6560-17-5
  • NOUZA, J., NOUZA, T.: A Voice Dictation System for a Million-Word Czech Vocabulary. In: Proc. of ICCCT 2004, August 2004, Austin, USA, pp. 149-152, ISBN 980-6560-17-5,


2003

  • DRÁBKOVÁ, J.: Formation of Classes for Continuous Speech Language Model and Building the Large Tagging Vocabulary for Czech Language. In: Proc. of 13th Czech-German Workshop „Speech Processing", September 2003, Prague, Czech Republic, pp. 121-125, ISBN 80-86269-10-8
  • NEJEDLOVÁ, D.: Construction of a Dictation System for Czech Physicians. In: Proc. of 13th Czech-German Workshop „Speech Processing", September 2003, Prague, Czech Republic, pp. 113-115, ISBN 80-86269-10-8
  • ŽĎÁNSKÝ , J., NOUZA, J.: Experimental Optimization of the Continuous Speech Recognition System. In: Proc. of 13th Czech-German Workshop „Speech Processing", September 2003, Prague, Czech Republicpp, 129-134, ISBN 80-86269-10-8
  • DAVID, P.: Using TRANSCRIBER Tool for Broadcast News Transcription. In: Proc. of 13th Czech-German Workshop „Speech Processing", September 2003, Prague, Czech Republic, pp. 116-120, ISBN 80-86269-10-8
  • CHALOUPKA, J.: The Face Detection and Lips Tracking for Audio-Visual Speech Recognition. In: Proc. of 13th Czech-German Workshop „Speech Processing", September 2003, Prague, Czech Republic, pp. 141-145, ISBN 80-86269-10-8
  • NEJEDLOVÁ, D., NOUZA, J.: Building of a Vocabulary for the Automatic Voice-Dictation System. In 6th International Conference TSD 2003. České Budějovice, September 2003, Springer-Verlag, Heidelberg, pp. 301-308, ISBN 3-540-20024-X, ISSN 0302-9743.
  • HOLADA, M., NOUZA, J.: VISPER II - Enhanced Version of the Educational Software for Speech Processing Courses. In Proc. of the 8th European Conference on Speech Communication and Technology EuroSpeech 2003. Geneva-Switzerland, September 2003. pp. 3169-3172. ISSN 1018-4074
  • CHALOUPKA, J.: The Czech Computerized Talking Head "Chatter". In Proc. of 7th World Multiconference on Systemics, Cybernetics and Informatics-SCI 2003. Orlando-USA, July 2003. Volume IV. pp. 320-323. ISBN 980-6560-01-9
  • DAVID, P.: Presentation of Real-time System for Automatic Speaker Identification and Verification. In Proc. of 7th World Multiconference on Systemics, Cybernetics and Informatics-SCI 2003. Orlando-USA, July 2003. Volume IV. pp. 372-376. ISBN 980-6560-01-9
  • HOLADA, M.: Internet Speech Recognition Server. In Proc. of 7th World Multiconference on Systemics, Cybernetics and Informatics-SCI 2003. Orlando-USA, July 2003. Volume IV. pp. 388-391. ISBN 980-6560-01-9
  • DRÁBKOVÁ, J.: How good is speech recognition performed by human and by machine? In Proc. of 6th International Workshop on Elektronics, Control, Measurment and Signals-ECMS 2003. Liberec, June 2003. pp. 79-83. ISBN 80-7083-708-X
  • NEJEDLOVÁ, D.: Building and Evaluation of a Large Vocabulary for a Czech Voice Dictation System. In Proc. of 6th International Workshop on Elektronics, Control, Measurment and Signals-ECMS 2003. Liberec, June 2003. pp. 74-78. ISBN 80-7083-708-X
  • NOUZA, J.: Voice Dictation into a PC: Recent Research State at TUL. In Proc. of 6th International Workshop on Elektronics, Control, Measurment and Signals-ECMS 2003. Liberec, June 2003. pp. 69-73. ISBN 80-7083-708-X
  • HOLADA, M.: Design a Prototype of Client – Server Speech Recognition System. In Proc. of 6th International Workshop on Elektronics, Control, Measurment and Signals-ECMS 2003. Liberec, June 2003. pp. 26-29. ISBN 80-7083-708-X
  • DAVID, P.: Unsupervised Segmentation of Audio Recordings. In Proc. of 6th International Workshop on Elektronics, Control, Measurment and Signals-ECMS 2003. Liberec, June 2003. pp. 17-20. ISBN 80-7083-708-X
  • CHALOUPKA, J.: The Czech Audio-Visual Speech Synthesizer System. In Proc. of 6th International Workshop on Elektronics, Control, Measurment and Signals-ECMS 2003. Liberec, June 2003. pp. 30-33. ISBN 80-7083-708-X
  • SEMENEC, P., HOLADA, M.: The Acoustic Model of Phone Line for ASR Databases Recorded by Microphone. In Proc. of Radioelektronika 2003. Brno, May 2003. pp. 364-367. ISBN 80-214-2383-8
  • CHALOUPKA, J.: Multimodal Signal Processing and Research. In Proc. of Radioelektronika 2003. Brno, May 2003. pp. 388-389. ISBN 80-214-2383-8


2002

  • DRÁBKOVÁ, J.: Language model based on the Czech morphology. In Proc. of 12th Czech-German Workshop „Speech Processing". Prague, September 2002. pp. 70-73. ISBN 80-86269-09-4
  • HOLADA, M.: Desing of distributed recognition system via Internet. In Proc. of 12th Czech-German Workshop „Speech Processing". Prague, September 2002. pp. 79-83. ISBN 80-86269-09-4
  • DAVID, P.: Presentation of real-time system for automatic speaker identification. In Proc. of 12th Czech-German Workshop „Speech Processing". Prague, September 2002. pp. 74-78. ISBN 80-86269-09-4
  • CHALOUPKA, J.: Development of New Czech 3-D Talking Head. In Proc. of 12th Czech-German Workshop „Speech Processing". Prague, September 2002. pp. 54-58. ISBN 80-86269-09-4
  • NEJEDLOVÁ, D.: Building a 20K Vocabulary and Language Model for Czech Language. In Proc. of 12th Czech-German Workshop „Speech Processing". Prague, September 2002. pp. 66-69. ISBN 80-86269-09-4
  • NOUZA, J., DRÁBKOVÁ, J.: Combining Lexical and MorPhological Knowledge in Language Model FOR InflEctIonal (Czech) Language. In Proc. of 6th Int. Conference on Spoken Language Processing. Denver USA, September 2002. ISBN 1876346418
  • HOLADA, M., NOUZA, J.: The Experimental Support for Speech Processing and Recognition Teaching - Experimentální podpora pro výuku metod zpracování a rozpoznávání řeči. Moderní směry výuky elektrotechniky a elektroniky. In Proc. of STO-8. Brno, September 2002. pp. 183-186.
    ISBN 80-214-2190-8
  • NOUZA, J., KOLÁŘ, P., CHALOUPKA, J.: Voice Chat with a Virtual Character: The Good Soldier Švejk Case Project. In Proc of TSD 2002. Brno, September 2002. pp. 445-448. ISBN 0302-9743
  • NOUZA, J.: Strategies for Developing a Real-Time Continuous Speech Recognition System for Czech Language. In Proc. of TSD 2002. Brno, September 2002. pp. 189-196. ISBN 0302-9743
  • NEJEDLOVÁ, D.: Comparative Study on Bigram Language Models for Spoken Czech Recognition. In Proc. of TSD 2002. Brno, September 2002. pp. 197-204. ISBN 0302-9743
  • NEJEDLOVÁ, D., NOUZA, J.: Language Model Support for Continuous Speech Recognition in Czech Language. In Proc. of IASTED International Conference "SPPRA 2002". Greece, Crete June 2002. pp. 541 - 546. ISBN 0-88986-338-5
  • NOUZA, T., NOUZA, J., DRÁBKOVÁ, J.: An Efficient Graphic System for Developing Voice Operated Applications. In Proc. of SCI 2002. Orlando USA, July 2002, Volume I. pp. 239-244.
    ISBN 980-07-8150-1
  • CHALOUPKA, J., NOUZA, J., DRÁBKOVÁ, J.: Developing an Artificial Talking Head for Czech Language. In Proc. of SCI 2002. Orlando USA, July 2002, Volume III. pp. 232-236.
    ISBN 980-07-8150-1
  • CHALOUPKA, J., NOUZA, J., PŘIBIL, J.: Czech-Speaking Artificial Face. In Proc. of Biosignal 2002. Brno, June 2002. pp. 403-405. ISBN 80-214-2120-7
  • CHALOUPKA, J.: Talking Head: How Much Comprehensible Is It? In Proc. of Radioelektronika 2002. Bratislava, May 2002. pp. 202-205. ISBN 80-227-1700-2
  • NOUZA, J.: Building a System for Recognition of Fluently Spoken Czech. In Proc. of Radioelektronika 2002. Bratislava, May 2002. pp. 166-169. ISBN 80-227-1700-2
  • DAVID, P.: Experiments with Speaker Recognition using GMM. In Proc. of Radioelektronika 2002. Bratislava, May 2002. pp. 353-357. ISBN 80-227-1700-2


2001

  • Přibil J, Nouza J.: Application of Speech Synthesis into Automatic Voice Information System. Proc. of ELEKTRO 2001.
  • Nouza J.: Using LEGO Mindstorms in Mechatronics and Artificial Intelligence Projects. Proc. of Mechatronika 2001.
  • Nouza J., Volejník M.: Study on Phoneme Recognition in Spoken Czech. Proc of Radioelektronika 2001. Brno, April 2001
  • Nouza J.: A Scheme for Improved Key-Phrase Detection and Recognition in the InfoCity System. Proc. of 5th ECM2S workshop.Toulouse, May 2001, pp.237-241.
  • Nouza T., Nouza J.: Graphic Design of Voice Dialogue Applications.Proc. INTERACT2001 conference. Tokyo, July 2001, pp.702-703.
  • Nouza T., Nouza J.: Graphic Platform for designing and developing practical voice interaction systems. Proc. of Eurospeech2001. Aalborg, Sept. 2001, pp.1287-1290. ISBN 87-90834-09-7. ISSN 1018-4074.
  • CHALOUPKA, J., NOUZA, J.: Baldi (talking head) speaking Czech. In Proc. of 11th Czech-German Workshop „Speech Processing". Prague, September 2001. pp. 53-56. ISBN 80-86269-07-8
  • Holada M.: Speech Processing for Duplex Communication. of 11th Czech-German Workshop „Speech Processing", Prague 2001. ISBN 80-86269-07-8.


2000

  • NEJEDLOVÁ D.: FONETICKÁ TRANSKRIPCE ČEŠTINY POMOCÍ TŘÍVRSTVÉ NEURONOVÉ SÍTĚ. Výzkumná zpráva č. ISRN-TUL-KES-T-PZ-00-005-C1-CZ, TU Liberec, 2000,
  • NOUZA J., Holada M.: A Voice-Operated Multi-Domain Telephone Information System. Proc. of 25th Int. Conference on Acoustics, Speech and Signal Processing (ICASSP2000), Istanbul, June 2000, vol.VI, pp.3755-3758 (ISBN 0 7803-6296-9)
  • NOUZA J.: Speech Processing Technology Applied in Public Telephone Information Services. Proc. of 4th World Conference on Systemics, Cybernetics and Informatics (SCI 2000), Orlando, July 2000, vol. IV, pp.308-313 (ISBN 980-07-6690-1)
  • NOUZA J, MYSLIVEC M.: Methods and Application of Phonetic Label Alignment in Speech Processing Tasks. Radioengineering, vol.9, no.4, pp. 1-7 (ISSN 1210-2512)
  • NOUZA J.: A Czech Large Vocabulary Recognition System for Real-Time Applications. In Text, Speech and Dialogue (eds. Sojka, Kopecek, Pala) Springer-Verlag, Heidelberg, 2000, pp. 217-222 (ISBN 3-540-66494-7)
  • NOUZA J, NOUZA T.: Improvements and Innovations in a Voice-Operated Telephone Dialogue System. Proc. of Radioelektronika 2000, Bratislava, Sept. 2000, pp.III/100-103 (ISBN 80-227-1389-9)
  • MYSLIVEC M., VOLEJNÍK M.: First experiments with phoneticly oriented spech recognition of Czech. Proc. of Radioelektronika 2000, Bratislava, Sept. 2000, pp.III/100-103 (ISBN 80-227-1389-9)
  • NOUZA J., Holada M.: A Voice-Operated Multi-Domain Telephone Information System. Proc. of 25th Int. Conference on Acoustics, Speech and Signal Processing (ICASSP2000), Istanbul, June 2000, vol.VI, pp.3755-3758 (ISBN 0 7803-6296-9)
  • NOUZA J. : System for visual and experimental introduction to basic speech recognition algorithms. Proc of ACL2000 conference, Hong-Kong, October 2000. Companion volume, pp.11-12. (ISBN 1-55860-730-7)
  • NOUZA J.: Speech Processing Technology Applied in Public Telephone Information Services. Proc. of 4th World Conference on Systemics, Cybernetics and Informatics (SCI 2000), Orlando, July 2000, vol. IV, pp.308-313 (ISBN 980-07-6690-1)
  • NOUZA J.: Telephone Speech Recognition from Large Lists of Czech Words. Proc. of ICSLP2000, Beijing, October 2000, vol. IV, pp.394-397 (ISBN 7-80150-114-4)
  • NOUZA J.: Evaluation Report from Half-Year Trial Run of the Infocity System. In Proc. of 9th Czech-German Workshop „Speech Processing", Prague 2000, pp.45-47. (ISBN 80-86269-03-5)
  • NOUZA J., NOUZA T.: A Program for Computer-Aided Pronunciation Learning of English. In Proc. of 9th Czech-German Workshop „Speech Processing", Prague 2000, pp.48-51. (ISBN 80-86269-03-5)
  • MYSLIVEC M.: A Subjective Recognition Test on Different Types of Sub-Word Units In Proc. of 9th Czech-German Workshop „Speech Processing", Prague 2000, pp.52-54. (ISBN 80-86269-03-5)
  • NEJEDLOVÁ D., NOUZA J.: Phonetic Transcription of Czech Language Using a NETtalk-type Neural Network. In Proc. of 9th Czech-German Workshop „Speech Processing", Prague 2000. (ISBN 80-86269-05-1)
  • NOUZA J, NOUZA T.: A New Flexible System for Fast Development of Voice Dialogue Applications. In Proc. of 9th Czech- German Workshop „Speech Processing", Prague 2000. (ISBN 80-86269-05-1)
  • VOLEJNÍK M: Creating Vocabulary and Language Model for Speech Recognition from Czech Newspaper Corpus. In Proc. of 9th Czech-German Workshop „Speech Processing", Prague 2000. (ISBN 80-86269-05-1)
  • MYSLIVEC M.: Experience with Continuous Speech Recognition Based on Subword Units. In Proc. of 9th Czech-German Workshop „Speech Processing", Prague 2000. (ISBN 80-86269-05-1)
  • Nejedlová D., Nouza J.: phonetic transcription of czech language Using a NETtalk-type Neural Network. Proc. of 10th Czech-German Workshop Speech Processing", Prague 2000, pp.37-40. (ISBN 80-86269-05-1)
  • Nouza J, Nouza T.: A New Flexible System for Fast Development of Voice Dialogue Applications. Proc. of 10th Czech- German Workshop Speech Processing", Prague 2000, pp.41-43. (ISBN 80-86269-05-1)
  • Nouza J., Nouza T.: Rozpoznávání reci ji není pouhým snem (Nasazení systému pro rozpoznávání reci v praxi). Computerworld, ročník 12, číslo 1, str.24-26 (ISSN 1210-9924)


1999

  • NOUZA J.: Teaching and Training through Visualized Speech Processing Experiemnts. Proc. of MATISSE Workshop, London, April 1999, pp.121-124.
  • MYSLIVEC M., NOUZA J.: Study on Signal Shift in Speech Recognition. Proc. of Radioelektronika´99, Brno, April 1998, p.164-167.
  • NOUZA J.: Research and Development in Speech Processing at Technical University of Liberec. Proc. of. 4th ECMS Workshop, Liberec, May 1999, pp.13-19.
  • NOUZA J., Myslivec M.:Creating and Annotating Speech Database for Continuous Speech Recognition. Proc. of. 4th ECMS Workshop, Liberec, May 1999, pp.147-151.
  • NOUZA J.: Computer-Aided Spoken-Language Training with Enhanced Visual and Auditory Feedback. Proc. of Eurospeech'99, Budapest, Sept. 1999, pp.183-186


1998

  • NOUZA J., NOUZA T.: A Spelling Module for a Czech Speech Recognition System. Proc. of Radioelektronika´98, Brno, April 1998, p.208-211
  • HOLADA M., NOUZA J.: Searching for Methods and Parameters for More Reliable Recognition of Telephone Speech. Proc. of Radioelektronika´98, Brno, April 1998, p.220-223
  • NOUZA J., MADLIKOVA J.: Evaluation Tests on Visual Feedback in Speech and Language Learning. Proc. of ESCA workshop on Speech Technology in Language Learning (STiLL), Stockholm, May, 1998, pp. 151-154.
  • NOUZA J., HOLADA M.: A City Information System Operating over the Telephone. Proc. of IVTTA'98 Workshop, Torino, Italy, September 1998, pp.141-144.
  • NOUZA J.: Training Speech through Visual Feedback Patterns. Proc. of ICSLP'98, Sydney, Australia, Dec. 1998, pp.3293-3296.


1997

  • HOLADA M., NOUZA J.: A New Version of a Voice Operated Information System. In Proc. of XXth Seminary on ASR, Ostrava, April 1997, p.15.
  • NOUZA J., HAJEK D.: A CDHMM Generator and Its Use in Speech Processing. In Proc. of 3rd ECMS'97 Workshop, Toulouse, France, June 1997, pp.70-79.
  • NOUZA J.: Visual Processing od Speech: Tools for Education, Aids for Handicaped. Proc. of ICSP'97, Seoul, Korea, pp.677-682.
  • HAJEK D., NOUZA J.: A Quasi-Triphone Model Created by Merging Context-Specific Phone Models. Proc. of 8. Konferenz Elektronische Sprachsignalverarbeitung, Cottbus, Germany, August 1997, pp. 85-92.
  • NOUZA J.: Visualization of Dynamic Programming Algorithms in Speech Processing Tasks. Proc. of Algoritmy'97, West Tatra Mountains, Slovakia, September, 1997, pp.182-190.
  • NOUZA J., HOLADA M., HAJEK D.: An Educational and Experimental Workbench for Visual Processing of Speech Data. Proc. of EUROSPEECH'97 Conference, Rhodes, Greece, September 1997, pp.661-664.
  • NOUZA J.:Spectral Variation Functions Applied to Acoustic-Phonetic Segmentation of Speech Signals. In: H.-W. Wodarz (Ed.), Speech Processing (Forum Phoneticum, 63), Frankfurt am Main, 1997, pp.43-58.
  • NOUZA J., PSUTKA J., UHLIR J.: Phonetic Alphabet for Speech Recognition of Czech. Radioengineering, vol.6, no.4, 1997, pp.16-20.


1996

  • NOUZA J.: A Two-level Classification Scheme for CDHMM-based Discrete-Utterance Recognition. Radioengineering, vol.5, no.1, 1996, pp.25-28.
  • NOUZA J., HAJEK D.: Channel Variability Compensation in Speech Recognition by Means of Feature Mean Subtraction. Proc. of RADIOELEKTRONIKA'96, Brno, April 1996, pp.156-159.
  • HAJEK D., NOUZA J.: Speaker Adaptation in HMM Based Speech Recognition. Proc. of RADIOELEKTRONIKA'96, Brno, April 1996, pp.328-331.
  • HAJEK D., NOUZA J.: Unhiding Hidden Markov Models by their Visualization. In Gobel M., David. J., Slavik P. and van Wijk J. (eds.) Virtual Environments and Scientific Visualization '96. Springer-Verlag, Wien - New York, 1996, pp.277-285.
  • NOUZA J: Feature Selection Methods for Hidden Markov Model Based Speech Recognition. Proc. of 13th Int. Conference on Pattern Recognition (ICPR'96), Vienna, Austria, August 1996, Vol.II, pp.186-190
  • NOUZA J.: Discrete-Utterance Recognition with a Fast Match Based on Total Data Reduction. Proc. of 4th Int. Conference on Spoken Language Processing (ICSLP'96), Philadelphia, USA, October 1996, pp.2107-2110.
  • NOUZA J., HAJEK D.: Speech Training and Motivating Tools for Hearing-Impaired People. Proc. of 7. Konferenz Elektronische Sprachsignalverarbeitung, Berlin, Germany, November 1996, pp. 154-159.
  • HAJEK D., NOUZA J.: Visualization of Data and Procedures in Speech Processing Tasks. Proc. of 7. Konferenz Elektronische Sprachsignalverarbeitung, Berlin, Germany, November 1996, pp. 218-223.
  • NOUZA J.: An Interface for Voice Access to Information Systems. Proc. of 18th Int. Conference on Information Technology Interfaces ITI'96, Pula, Croatia, June 1996, pp.79-84.


1995

  • HAJEK D., NOUZA J.: Robust HMM Training for Speaker Independent Discrete-Utterance Recognition. Proc. of 32nd Conference on Acoustics, Prague, Czech republic, Sept.1995, pp.41-44.
  • NOUZA J: On the Speech Feature Selection Problem: Are Dynamic Features More Important Than the Static Ones? Proc. of EUROSPEECH'95 Conference, Madrid, Spain, Sept. 1995, pp. 919-923.
  • NOUZA J: An Automatic Information System Operating on the Voice Dialogue Base. Proc. of 6. Konferenz Elektronische Sprachsignalverarbeitung, Wolfenbuttel, Germany, September, 1995, pp.145-150.
  • HAJEK D.: Optimized Implementation of Transcendental Functions for Real Time Tasks. In Proc. of int. ECMS workshop, Liberec, June 1995, pp.31-34.