Fernando de la Calle Silos

Researcher and Associate Professor
Signal Teory and Comunications Department
Universidad Carlos III de Madrid

fsilos [at] tsc [dot] uc3m [dot] es

Introduction

During my years as a researcher I have developed a deep passion for signal processing and machine learning, focusing on diverse areas such as Speech Recognition and Computer Vision. I am very passionate to be working on these topics as part of my research at University Carlos III of Madrid and Carnegie Mellon University, and would like to keep doing research on related areas.

Detailed information can be found on my curriculum vitae.

Publications

Dissertation

Bio-Motivated Features and Deep Learning for Robust Speech Recognition
F. de-la-Calle-Silos
Universidad Carlos III de Madrid, 2017

Journal

Synchrony-Based Feature Extraction for Robust Automatic Speech Recognition
F. de-la-Calle-Silos and Richard M. Stern
IEEE Signal Processing Letters, vol. 24, no. 8, pp. 1158, Aug. 2017

Morphologically- filtered power-normalized cochleograms as robust, biologically inspired features for ASR
F. de-la-Calle-Silos, F.J. Valverde-Albacete, A. Gallardo-Antolín, C. Pelaéz-Moreno.
IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 23, no. 11, pp. 2070-2080, Nov. 2015
Best indexed 2016 JCR journal publication by the Spanish Thematic Network on Speech Technology (RTTH)

International Conferences

Deep Residual Networks with Auditory Inspired Features for Robust Speech Recognition
F. de-la-Calle-Silos, A. Gallardo-Antolín, C. Pelaéz-Moreno.
Intespeech 2017

ASR Feature Extraction with Morphologically-Filtered Power-Normalized Cochleograms
F. de-la-Calle-Silos, F.J. Valverde-Albacete, A. Gallardo-Antolín, C. Pelaéz-Moreno.
International Speech Communication Association (Intespeech), Singapore, September 2014

Mid-Level Feature Set for Specific Event and Anomaly Detection in Crowded Scenes.
F. de-la-Calle-Silos, I. González-Díaz, F. Díaz-de-María
IEEE International Conference of Image Processing (ICIP), Melbourne, Australia. September, 2013.
Code, videos and poster

Local Conferences

An Analysis of Deep Neural Networks in Broad Phonetic Classes for Noisy Speech Recognition
F. de-la-Calle-Silos, A. Gallardo-Antolín, C. Pelaéz-Moreno.
Advances in Speech and Language Technologies for Iberian Languages: Third International Conference, Iberspeech 2016, Lisbon, Portugal, November 23-25, 2016, Proceedings. Lecture Notes in Computer Science

Preliminary experiments on the robustness of biologically motivated features for DNN-based ASR
F. de-la-Calle-Silos, F.J. Valverde-Albacete, A. Gallardo-Antolín, C. Pelaéz-Moreno.
4th International Work Conference on Bioinspired Intelligence (IWOBI 15), June 2015

Deep Maxout Networks applied to Noise-Robust Speech Recognition
F. de-la-Calle-Silos, F.J. Valverde-Albacete, A. Gallardo-Antolín, C. Pelaéz-Moreno.
Advances in Speech and Language Technologies for Iberian Languages. Lecture Notes in Computer Science, Springer 2014.

Education

Phd

Bio-Motivated Features and Deep Learning for Robust Speech Recognition
PhD in Multimedia and Communications
University Carlos III de Madrid
Jun 2013 - Sep 2017
Cum laude distinction and Ph.D. Outstanding Thesis Award.

MSc

Master in Multimedia and Communications.
University Carlos III de Madrid
Sep 2012 - Jun 2013

BSc

Bachelor in Telecommunication Technology Engineering
University Carlos III de Madrid
Sep 2008 - Jun 2012
Dissertation: Event Recognition in Crowded Scenes
Best Academic Record Graduation Award

Teaching

Speech, Audio, Image, and Video Processing Applications
Master in Telecommunications Engineering
University Carlos III de Madrid
Fall 2017

Algorithms for Multimedia Information Management
Bachelor in Telecommunication Technology Engineering
University Carlos III de Madrid
Fall 2016, Fall 2017

Multimedia Information Coding in Communications
Bachelor in Communication System Engineering
University Carlos III de Madrid
Spring 2018

Acoustical Instrumentation and Noise Control
Bachelor in Audiovisual System Engineering
University Carlos III de Madrid
Fall 2016, Fall 2017

Digital Audio Processing
Bachelor in Audiovisual System Engineering
University Carlos III de Madrid
Fall 2017

Code

fsilosSpeechToolbox: Implementation of all the feature extraction methods presented in my PhD thesis.

ResNet-Kaldi-Tensorflow-ASR: ResNet and other CNN implementations in Tensorflow presented in the paper: Deep Residual Networks with Auditory Inspired Features for Robust Speech Recognition.

MI_Feature_Selection: Matlab code that implements the feature selection algorithm using mutual information described in the ICIP 2013 paper