• Hynek Boril, Ph.D.


      Research Associate

      Center for Robust Speech Systems (CRSS)

      Erik Jonsson School of Engineering and Computer Science

      The University of Texas at Dallas

Selected Publications


Journal Articles

Boril, H., Hansen, J. H. L. (2010). “Unsupervised Equalization of Lombard Effect for Speech Recognition in Noisy Adverse Environments,” IEEE Transactions on Audio, Speech, and Language Processing, 18(6), 1379-1393. [pdf] [cited] [bib]

Boril, H. and Fousek, P. (2006). “Influence of different speech representations and HMM training strategies on ASR performance,” Acta Polytechnica, Journal on Advanced Engineering 46, 32-35. [pdf] [bib]

Book Reviews in Journals

Boril, H. (2011): “Pavel Machac and Radek Skarnitzl (2009). Foneticka segmentace hlasek. Prague: Epocha Publishing House” in press, Nase Rec (Our Speech), The Institute of Czech Language, Academy of Sciences of the Czech Republic, in Czech, Prague.

Boril, H. (2010): “Pavel Machac and Radek Skarnitzl (2009). Principles of Phonetic Segmentation. Prague: Epocha Publishing House” in: R. Skarnitzl (Ed.), Acta Universitatis Carolinae (AUC) Philologica 1/2009, Phonetica Pragensia XII, Karolinum Publishing House, Prague, pp. 63-64. [pdf] [bib]

Book Chapters

Boril, H., Boyraz, P., Hansen, J. H. L. (2011): DSP (Digital Signal Processing) for In-Vehicle Systems and Safety, chapter “Towards Multi-modal Driver's Stress Detection,” J. H. L. Hansen, P. Boyraz, K. Takeda, H. Abut (Eds.), in press, Springer, New York, 2011.

Conference/Workshop Proceedings

Boril, H., Grezl, F., Hansen, J. H. L. (2011). “Front-End Compensation Methods for LVCSR Under Lombard Effect,” in Proc. of Interspeech'11, 1257-1260, August 28-31 (Florence, Italy). [pdf] [bib]

Boril, H., Sadjadi, O., Hansen, J. H. L. (2011). “UTDrive: Emotion and Cognitive Load Classification for In-Vehicle Scenarios,” accepted to The 5th Biennial Workshop on Digital Signal Processing for In-Vehicle Systems, September 4-7 (Kiel, Germany). [pdf] [bib]

Boril, H., Hansen, J. H. L. (2011). “UT-Scope: Towards LVCSR under Lombard Effect Induced by Varying Types and Levels of Noisy Background,” IEEE ICASSP'11, 4472-4475, Prague, Czech Republic, May 2011. [pdf] [bib]

Boril, H., Hansen, J. H. L., et al. (2011). “A Longitudinal Study of Infant Speech Production Parameters: A Case Study,” LENA Users Conference, April 2011 (Denver, CO). [pdf] [bib]

Boril, H., Sangwan, A., Hasan, T., Hansen, J. H. L. (2010). “Automatic Excitement-Level Detection for Sports Highlights Generation,” in Proc. of Interspeech'10, 2202-2205 (Makuhari, Chiba, Japan). [pdf] [cited] [bib]

Boril, H., Sadjadi, O., Kleinschmidt, T., Hansen, J. H. L. (2010). “Analysis and Detection of Cognitive Load and Frustration in Drivers’ Speech,” in Proc. of Interspeech'10, 502-505 (Makuhari, Chiba, Japan). [pdf] [cited] [bib]

Amuda, S., Boril, H., Sangwan, A., Hansen, J. H. L. (2010). “Limited Resource Speech Recognition for Nigerian English,” in Proc. of IEEE ICASSP'10, 5090-5093 (Dallas, TX). [pdf] [bib]

Mehrabani, M., Boril, H., Hansen, J. H. L. (2010). “Dialect Distance Assessment Method Based on Comparison of Pitch Pattern Statistical Models,” in Proc. of IEEE ICASSP'10, 5158-5161 (Dallas, TX). [pdf] [cited] [bib]

Kleinschmidt, T., Boyraz, P., Boril, H., Sridharan, S., Hansen, J. H. L. (2009). “Assessment of Speech Dialog Systems using Multi-Modal Cognitive Load Analysis and Driving Performance Metrics,” IEEE International Conference on Vehicular Electronics and Safety ICVES`09, 162-167 (Pune, India). [pdf] [cited] [bib]

Boril, H., Hansen, J. H. L. (2009). “Reduced Complexity Equalization of Lombard Effect for Speech Recognition in Noisy Adverse Environments,” in Proc. of Interspeech'09, 1243-1246 (Brighton, UK). [pdf] [bib]

Boril, H., Boyraz, P., Hansen, J. H. L. (2009). “Towards Multi-Modal Driver's Stress Detection,” in Proc. of 4th Biennial Workshop on DSP for In-Vehicle Systems and Safety (Dallas, Texas). [pdf] [cited] [bib]

Boril, H., Krishnamurthy, N., Hansen, J. H. L. (2009). “Online Noise and Lombard Effect Compensation for In-Vehicle Automatic Speech Recognition,” in Proc. of 4th Biennial Workshop on DSP for In-Vehicle Systems and Safety (Dallas, Texas). [pdf] [bib]

Boril, H., Hansen, J. H. L. (2009). “Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environment,” in Proc. of IEEE ICASSP'09, 3937-3940 (Taipei, Taiwan). [pdf] [bib]

Boril, H., Fousek, P., and Höge, H. (2007). “Two-stage system for robust neutral/Lombard speech recognition”, in Proc. of Interspeech'07, 1074-1077 (Antwerp, Belgium). [pdf] [cited] [bib]

Boril, H., Boril, T., and Pollák, P. (2006). “Methodology of Lombard speech database acquisition: Experiences with CLSD”, in Proc. of LREC 2006 - 5th Conference on Language Resources and Evaluation, 1644-1647 (Genova, Italy). [pdf] [cited] [bib]

Boril, H., Fousek, P., and Pollák, P. (2006). “Data-driven design of front-end filter bank for Lombard speech recognition”, in Proc. of ICSLP'06, 381-384 (Pittsburgh, Pennsylvania). [pdf] [cited] [bib]

Boril, H., Fousek, P., Sündermann, D., Cerva, P., and Zdansky, J. (2006). “Lombard speech recognition: A comparative study”, in Proc. 16th Czech-German Workshop on Speech Processing, 141-148 (Prague, Czech Republic). [pdf] [cited] [bib]

Boril, H. (2005). “Automatic Reconstruction of Utterance Boundaries Time Marks in Speech Database Re-grabbed from DAT Recorder.” In Proc. of International Workshop on Digital Technologies 2005, 2005, vol. 1, 13-16 (Zilina, Slovakia). [pdf] [bib]

Boril, H. and Pollák, P. (2005). “Comparison of three Czech speech databases from the standpoint of Lombard effect appearance”, in COST278 and ISCA Tutorial and Research Workshop (ITRW) on Applied Spoken Language Interaction in Distributed Environments (ASIDE 2005), Aalborg, Denmark. [pdf] [bib]

Boril, H. and Pollák, P. (2005). “Design and collection of Czech Lombard Speech Database”, in Proc. of Interspeech'05, 1577-1580 (Lisboa, Portugal). [pdf] [cited] [bib]

Boril, H. and Pollák, P. (2004). “Direct time domain fundamental frequency estimation of speech in noisy conditions”, in Proc. EUSIPCO 2004, volume 2, 1003 - 1006 (Vienna, Austria). [pdf] [cited] [bib]

Lectures/Abstracts/Reports

Boril, H., Sangwan, A., Hasan, T., Hansen, J. H. L. (2010). “Automatic Excitement-Level Detection for Sports Highlights Generation,” in Wireless Long Term Evolution - The Connected World, UT Dallas Research and New Venture Showcase, Poster Presentation (Dallas, TX). [pdf] [bib]

Boril, H., Sadjadi, O., Kleinschmidt, T., Hansen, J. H. L. (2010). “Analysis and Detection of Cognitive Load and Frustration in Drivers’ Speech,” Wireless Long Term Evolution - The Connected World, UT Dallas Research and New Venture Showcase, Poster Presentation (Dallas, TX).

Lei, Y., Hasan, T., Suh, J.-W., Sangwan, A., Boril, H., Gang, L., Godin, K., Zhang, C., Hansen, J. H. L. (2010). “The CRSS Systems for the 2010 NIST Speaker Recognition Evaluation,” NIST 2010 Speaker Recognition Evaluation Workshop, Brno, Czech Republic, 24-25 June 2010. [pdf] [bib]

Boril, H., Kleinschmidt, T., Boyraz, P., Hansen, J. H. L. (2010). “Impact of Cognitive Load and Frustration on Drivers' Speech,” J. Acoust. Soc. Am., Volume 127, Issue 3, pp. 1996-1996 (March 2010). Presented in Joint 159th ASA Meeting and Noise-Con 2010, Baltimore, Maryland, 19-23 April 2010. Invited Lecture [pdf] [bib]

Boril, H. (2008). “Attributes and Recognition of Lombard Speech,” Invited Lecture, Sound to Sense (S2S) Workshop - Speech in Adverse Conditions (Prague, Czech Republic). [ppt] [bib]

Boril, H. (2007). “Normalization of Lombard effect”, Research Report No. R07-2, 52 pages, Czech Technical University in Prague & Siemens Corporate Technology (Munich, Germany). [bib]

Boril, H. and Pollák, P. (2006). “Czech Lombard Speech Database (CLSD`05)”, Technical Report No. R07-1, 24 pages, Czech Technical University in Prague. [pdf] [bib]

Boril, H. and Pollák, P. (2006). “Pitch-marking Based on the DFE Algorithm.” Lecture, 6th ECESS and TC-STAR WP3 Meeting (Berlin, Germany). [pdf] [bib]

Theses

Boril, H. (2008). “Robust speech recognition: Analysis and equalization of Lombard effect in Czech corpora,” Ph.D. dissertation, Czech Technical University in Prague, Czech Republic. [pdf] [cited] [bib]

Boril, H. (2003). “Guitar MIDI converter”, Master's thesis, Czech Technical University in Prague, in Czech. [pdf] [cited] [bib]

Other Publications


Proceedings

Boril, H. (2006). “Design of Speech Feedback; Comparison of Features for Lombard speech recognition,” Analysis and Processing of Speech and Bilogical Signals, CTU Publishing House, Prague, pp. 24-30, in Czech.

Boril, H., Fousek, P. (2006). “Influence of Different Speech Representations and HMM Training Strategies on ASR Performance,” In Proc. Intl. Student Conf. POSTER 2006, Prague.

Boril, H. and Pollák, P. (2005). “Analysis of Lombard Effect in Several Czech Databases,” In Proceedings of the Joint 16th Conference on Electronic Speech Signal Processing ESSP 2005 and 15th Czech-German Workshop on Speech Processing. vol. 1, pp. 253-259, Prague.

Boril, H., Boril, T., and Pollák, P. (2005). “Design of Lombard Effect Speech Database,” In Proceedings of Proc. RADIOELEKTRONIKA 2005, pp. 144-147, Brno, Czech Republic.

Boril, H. (2004). “Recognition of Speech under Lombard Effect,” In Proc. 14th Czech-German Workshop on Speech Processing, pp. 110-113, Prague 2004. [cited]

Boril, H. (2004). “Parameter Changes and Recognition of Speech under Stress,” Survey, Signal Analysis and Processing V, CTU Publishing House, Prague, pp. 54-65, in Czech.

Boril, H. (2003). “Direct Time Fundamental Frequency Estimation,” In Proc. Polish-Hungarian-Czech Workshop on Circuit Theory, Signal Processing and Applications, pp. 59-64, Prague.

Boril, H. (2003). “Pitch Detector for Guitar MIDI Converter,” In Proc. Intl. Student Conf. POSTER 2003, Prague. [cited]

Last Updated 9-14-2010


[cited]