Malcolm Slaney

Publications by Year (See also Google Scholar)


2024
Malcolm Slaney and Matthew Fitzgeral. Comparing human and machine speech recognition in noise with QuickSIN. JASA Express Letters 4, 095202 (2024). HTML

Richard F. Lyon, Rob Schonberger, Malcolm Slaney, Mihajlo Velimirovic, Honglin Yu. The CARFAC v2 Cochlear Model in Matlab, NumPy, and JAX. https://arxiv.org/abs/2404.17490

Jens Hjortkjaer, Daniel DE Wong, Alessandro Catania, Jonatan Marcher-Rorsted, Enea Ceolini, Soren Asp Fuglsang, Ilya Kiselev, Giovanni Di Liberto, Shih-Chii Liu, Torsten Dau, Malcolm Slaney, Alain de Cheveigne, Real-time control of a hearing instrument with EEG-based attention decoding. bioRxiv, 2024.03. 01.582668. PDF

2023
C Bregler, M Covell, Malcolm Slaney. Video rewrite: Driving visual speech with audio. Seminal Graphics Papers: Pushing the Boundaries, Volume 2, ACM SigGraph, pp. 715-722, 2023. PDF

A Omran, N Zeghidour, Z Borsos, F de Chaumont Quitry, Malcolm Slaney, M. Tagliasacchi. Disentangling speech from surroundings with neural embeddings. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes, Greece, 2023. PDF

DT Speckhard, K Misiunas, S Perel, T Zhu, S Carlile, Malcolm Slaney. Neural architecture search for energy-efficient always-on audio machine learning. Neural Computing and Applications 35 (16), 12133-12144, 2023. PDF

Malcolm Slaney. Machine Learning for Audition. Keynote at the Virtual Conference on Computational Audition, 2023.

2022
C Han, EM Kaya, K Hoefer, Malcolm Slaney, S Carlile. Multi-channel speech denoising for machine ears. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022. PDF

2021

Malcolm Slaney, Richard F. Lyon. Speech and hearing for the next billion users. Invited presentation at the 181st Meeting Acoustical Society of America, Seattle, Washington, December 2021.

Artem Dementyev, Pascal Getreuer, Dimitri Kanevsky, Malcolm Slaney, Richard F Lyon. VHP: Vibrotactile Haptics Platform for On-body Applications. UIST '21: The 34th Annual ACM Symposium on User Interface Software and Technology, October 2021 Pages 598-612. PDF

Alain de Cheveigné, Malcolm Slaney, Søren A Fuglsang and Jens Hjortkjaer. Auditory stimulus-response modeling with a match-mismatch task. Journal of Neural Engineering, Volume 18, Number 4, 2021. PDF

2020
Malcolm Slaney, Richard F Lyon, Ricardo Garcia, Brian Kemler, Chet Gnegy, Kevin Wilson, Dimitri Kanevsky, Sagar Savla, Vinton G Cerf. Auditory measures for the next billion users. Ear and Hearing, 41, pp. 131S-139S, 2020. PDF

Gitte Keidser, Graham Naylor, Douglas S. Brungart, Andreas Caduff, Jennifer Campos, Simon Carlile, Mark G. Carpenter, Giso Grimm, Volker Hohmann, Inga Holube, Stefan Launer, Thomas Lunner, Ravish Mehra, Frances Rapport, Malcolm Slaney, and Karolina Smeds. The Quest for Ecological Validity in Hearing Science: What It Is, Why It Matters, and How to Advance It. Ear and Hearing, 41. pp. 5S-19S, 2020. PDF

Jaswanth Reddy Katthi, Sriram Ganapathy, Sandeep Kothinti, Malcolm Slaney. Deep Canonical Correlation Analysis For Decoding The Auditory Brain. Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), 2020. PDF

2019
Sijia Zhao, Nga Wai Yum, Lucas Benjamin, Elia Benhamou, Makoto Yoneya, Shigeto Furukawa, Fred Dick, Malcolm Slaney, Maria Chait. Rapid ocular responses are modulated by bottom-up driven auditory salience. Journal of Neuroscience, 0776-19, 2019. PDF

SC Liu, JG Harris, M Elhilali, M Slaney. Bio-inspired Audio Processing, Models and System, Frontiers in Neuroscience, 13, 2019. HTML

Ariel Goldstein, Aren Jansen, Malcolm Slaney, Amy Price, Zaid Kokaja Zada, Gina Choe, Bobbi Aubrey, Aditi Rao, Lora Fanda, Kenneth Norman, Adeen Flinker, Orrin Devinsky, Michael Brenner, Uri Hasson. Temporal Dynamics of Meaning. 2019 Conference on Cognitive Computational Neuroscience, Berlin, Germany, 13-16 September 2019. PDF

2018
Nicholas Huang, Malcolm Slaney, Mounya Elhilali. Connecting Deep Neural Networks to Physical, Perceptual, and Electrophysiological Auditory Signals. Frontiers in Neuroscience, Volume 12, pages 532, 2018. PDF

Daniel D. E. Wong, Soren A. Fuglsang, Jens Hjortkaer, Enea Ceolini, Malcolm Slaney, Alain de Cheveigne. A Comparison of Regularization Methods in Forward and Backward Models for Auditory Attention Decoding. Frontiers in Neuroscience, Volume 12, 531, 2018.
PDF

Ken Hoover, Sourish Chaudhuri, Caroline Pantofaru, Ian Sturdy, Malcolm Slaney. Using audio-visual information to understand speaker activity: Tracking active speakers on and off screen. In ICASSP 2018, Banff, Canada, April 2018. PDF

MH Anderson, BW Yazel, MPF Stickle, Iniguez FD Espinosa, NS Gutierrez, Malcolm Slaney, SS Joshi, LM Miller. Towards mobile gaze-directed beamforming: a novel neuro-technology for hearing loss. Proc, IEEE Engineering in Medicine and Biology Society Conference, 2018. PDF

Alain de Cheveigne, Daniel D.E. Wong, Giovanni M. Di Liberto, Jens Hjortkaer, Malcolm Slaney, Edmund Lalor. Decoding the auditory brain with canonical component analysis. NeuroImage, Volume 172, pp. 206-216, 15 May 2018.
PDF

2017
Shawn Hershey, Sourish Chaudhuri, Daniel Ellis, Jort Gemmeke, Aren Jansen, Channing Moore, Manoj Plakal, Devin Platt, Rif Saurous, Bryan Seybold, Malcolm Slaney, Ron Weiss, Kevin Wilson. CNN Architectures for Large-scale Audio Classification. In ICASSP 2017, New Orleans, March 2017. [arXiv PDF]

2016
Daniel D. E. Wong, Ulrich Pomper, Emina Alickovic, Jens Hjortkaer, Malcolm Slaney, Shihab Shamma, Alain de Cheveigne. Decoding Speech Sound Source Direction from Electroencephalography Data. ARO winter meeting (abstract), February 2016.

2015
T. J. Tsai, Andreas Stolcke, and Malcolm Slaney. A Study of Multimodal Addressee Detection in Human-Human-Computer Interaction. IEEE Transactions on Multimedia, 17(9), September 2015. [PDF]

TJ Tsai, Andreas Stolcke, and Malcolm Slaney. Multimodal addressee detection in multiparty dialogue systems. In Proc. IEEE ICASSP, IEEE SPS, Brisbane, Australia. April 2015. [PDF]

Anna Prokofieva, Malcolm Slaney, Dilek Hakkani-Tür. Probabilistic features for conecting eye gaze to spoken language understanding. In Proc. IEEE ICASSP, IEEE SPS, Brisbane, Australia. April 2015. [PDF]

2014
Malcolm Slaney and Dilek Hakkani-Tür. Eye gaze for speech recognition and understanding. Demonstration at SLT 2014, South Lake Tahoe, CA, December 2014.  [PDF]

Anna Prokofieva, Dilek Hakkani-Tür, Malcolm Slaney. Eye gaze for understanding conversational speech. IEEE Workshop on Spoken Language Technology, South Lake Tahoe, NV, December 2014. [PDF]

Sree Harsha Yella, Andreas Stolcke, Malcolm Slaney. Artificial neural network features for speaker diarization. IEEE Workshop on Spoken Language Technology, South Lake Tahoe, NV, December 2014. [PDF]

Malcolm Slaney, Andreas Stolcke, Dilek Hakkani-Tür. "The relation of eye gaze and face pose: Potential impact on speech recognition." ACM International Conference on Multimodal Interactions (ICMI), Istanbul, Turkey, November 2014. [PDF]

Dilek Hakkani-Tür, Malcolm Slaney, Asli Celikyilmaz, Larry Heck. "Eye gaze for spoken language understanding in multi-modal conversational interactions." ACM International Conference on Multimodal Interactions (ICMI), Istanbul, Turkey, November 2014. [PDF]

Malcolm Slaney. Spectrogram Inversion Toolkit for Matlab. IEEE Signal Processing Society SLTC Newsletter, November 2014. [PDF]

Phil Pitts, Arrigo Benedetti, Malcolm Slaney, and Phil Chou. Time of Flight Tracer. Microsoft Research Technical Report MSR-TR-2014-142, November 2014. [Technical report or link to code]

Malcolm Slaney, Michael L. Seltzer. "The influence of pitch and noise on the discriminability of filterbank features." Proceedings of Interspeech 2014, Singapore, 2014. [PDF]

Dong Yu, Adam Eversole, Michael L. Seltzer, Kaisheng Yao, Zhiheng Huang, Brian Guenter, Oleksii Kuchaiev, Yu Zhang, Frank Seide, Huaming Wang, Jasha Droppo, Geoffrey Zweig, Chris Rossbach, Jon Currey, Jie Gao, Avner May, Baolin Peng, Andreas Stolcke, Malcolm Slaney. "An Introduction to Computational Networks and the Computational Network Toolkit." Microsoft Research, MSR-TR-2014-112, October 2014. [Technical report or link to code]

Yan Huang, Malcolm Slaney, Michael L. Seltzer, and Yifan Gong. "Towards better performance with heterogeneous training data in acoustic modeling using deep neural networks."  Proceedings of Interspeech 2014, Singapore, 2014. [PDF]

Malcolm Slaney. "Spectrogram Inversion Toolbox." Microsoft Research, September 2014. [Software]

Sook Young Won, Jonathan Berger, and Malcolm Slaney. "Simulation of one's own voice in a two-parameter model." Proceedings of the International Conference on Music Perception and Cognition (ICMPC), Seoul, South Korea, August 2014. [PDF]

Neville Ryant, Malcolm Slaney, Mark Liberman, Elizabeth Shriberg, and Jiahong Yuan. "Highly accurate mandarin tone classification in the absence of pitch information." In the Proceedings of Speech Prosody, Dublin, May 2014. [PDF]

Malcolm Slaney, Rahul Rajan, Andreas Stolcke, and Partha Parthasarathy. "Gaze-enhanced speech recognition." In Proc. IEEE ICASSP, IEEE SPS, Florence, Italy. May 2014. [PDF]

James A. O'Sullivan, Alan J. Power,  Nima Mesgarani,  Siddharth Rajaram,  John J. Foxe, Barbara G. Shinn-Cunningham,  Malcolm Slaney,  Shihab A. Shamma and Edmund C. Lalor. "Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG." Cerebral Cortex, January 2014. [PDF]

2013
Blair Kaneshiro, Hyung-Suk Kim, Jorge Herrera, Jieun Oh, Jonathan Berger and Malcolm Slaney. "QBT-Extended: An annotated dataset of melodically contoured tapped queries," in Proceedings of the International Society of Music Information Retrieval (ISMIR), Curitiba, PR, Brazil, November 2013. [PDF]

Seyed Omid Sadjadi, Malcolm Slaney, and Larry Heck. "MSR Identity Toolbox v1.0: A MATLAB Toolbox for Speaker-Recognition Research." IEEE Signal Processing Society SLTC Newsletter, November 2013. [Link]

Malcolm Slaney, Elizabeth Shriberg, Jui-Ting Huang. "Pitch-Gesture Modeling Using Subband Autocorrelation Change Detection," in Proceedings of InterSpeech 2013, Lyon, France, August 2013. [PDF and software]

Jieun Oh, Eunjoon Cho, Malcolm Slaney. "Contours of Syllabic-Level Units in Laughter," in Proceedings of InterSpeech 2013, Lyon, France, August 2013. [PDF]

Seyed Omid Sadjadi, Malcolm Slaney, and Larry Heck. "MSR Identity Toolbox v1.0: A MATLAB Toolbox for Speaker-Recognition Research."" Microsoft Research, November 2013. [Software]

Edmund Lalor, Nima Mesgarani, Siddharth Rajaram, Adam O'Donovan, James Wright, Inyong Choi, Jonathan Brumberg, Nai Ding, Adrian KC Lee, Nils Peters, Sudarshan Ramenahalli, Jeffrey Pompe, Barbara Shinn-Cunningham, Malcolm Slaney, and Shihab Shamma. "Decoding Auditory Attention (in Real Time) with EEG," in Proceedings of the 37th ARO MidWinter Meeting, Association for Research in Otolaryngology (ARO), 17 February 2013. [PDF]

Ivan Tashev and Malcolm Slaney. "Data Driven Suppression Rule for Speech Enhancement." In Information Theory and Applications Workshop, University of California - San Diego, 14 February 2013. [PDF]

Ramesh Jain and Malcolm Slaney. "Micro Stories and Mega Stories." Visions and Views Column, IEEE Multimedia Magazine, January 2013. [PDF]

2012
Malcolm Slaney and Chris Bregler. "Image-based Facial Synthesis." In Audio-Visual Speech Processing, Eric Bateson, Gérard Bailly and Pascal Perrier, eds., Cambridge University Press, 2012.
 
Malcolm Slaney. "Pay Attention Please: Attention at the Telluride Neuromorphic Cognition Workshop." IEEE Signal Processing Society SLTC Newsletter, November 2012. [Link]

Aisling Kelliher and Malcolm Slaney. "Tell me a Story." Visions and Views Column, IEEE Multimedia Magazine, Winter 2012. [PDF]

Juhan Nam, Jorge Herrera, Malcolm Slaney, Julius Smith. "Learning Sparse Feature Representations for Music Annotation and Retrieval." Proceedings of the International Society of Music-Information Retrieval, Porto, Portugal, October 2012. [PDF]

Klara Nahrstedt and Malcolm Slaney, Malcolm. "Coulda, woulda, shoulda: 20 years of multimedia opportunities." Proceedings of the 20th ACM International Conference on Multimedia, Nara, Japan, 2012. [PDF]
 
Ajay Divakaran, Malcolm Slaney, and Martha Larson. "Audio analysis for consumer and other industrial applications." Proceedings of the 2012 ACM international workshop on Audio and multimedia methods for large-scale video analysis, AMVA '12, Nara, Japan, pp. 33-34, 2012. [PDF]

Malcolm Slaney, Yury Lifshits, Junfeng He. "Optimal Parameters for Locality-Sensitive Hashing." In a special issue of the Proceedings of the IEEE on Web-Scale Multimedia, September 2012. [PDF and Code]

David Ayman Shamma and Malcolm Slaney. "Don’t Click Here." Visions and Views Column, IEEE Multimedia Magazine, Summer 2012. [PDF]

Malcolm Slaney, Trevor Agus, Shih-Chii Liu, Merve Kaya, Mounya Elhilali. "A Model of Attention-Driven Scene Analysis." Proceedings of the International Conference on Acoustics, Speech and Signal Processing, Kyoto, Japan, March, 2012. [PDF]

2011
Dulce Ponceleon and Malcolm Slaney. "Multimedia Information Retrieval." In Modern Information Retrieval, Ricardo Baeza-Yates and Berthier Ribeiro-Neto, Second Edition, 2011.

Lyndon Kennedy, Malcolm Slaney. "Identifying Authoritative Sources of Multimedia Content." Proceedings of the ACM Conference on Multimedia, Scottsdale, AZ, November 2011. [PDF]

Malcolm Slaney. "Precision-Recall is Wrong for Multimedia." Visions and Views Column, IEEE Multimedia Magazine, Fall 2011. [PDF]

Juhan Nam, Jiquan Ngiam, Honglak Lee and Malcolm Slaney. "A Classification-based Polyphonic Piano Transcription Approach using Learned Feature Representations." In Proceedings of the International Symposium on Music Information Retrieval (ISMIR), October 24, 2011. [PDF]

Malcolm Slaney. "Does Content Matter?" Visions and Views Column, IEEE Multimedia Magazine, Summer 2011. [PDF]

Benjamin M. Marlin, Richard S. Zemel, Sam Roweis, and Malcolm Slaney. "Recommender Systems: Missing Data and Statistical Model Estimation." In IJCAI Best Paper Session, Barcelona, Spain, July 2011. [PDF]

Vidhya Navalpakkam, Justin Rao, Malcolm Slaney. "Using Gaze Patterns to Measure and Detect Distraction-induced Struggles while Reading." In Proceedings of the ACM SIGCHI Conference on Human Factors in Computing Systems, Vancouver, Canada, 2011. [PDF]

Malcolm Slaney and Patrick Naylor. Invited presentation for "Trends Expert Summary: Audio and Acoustic Signal Processing," ICASSP, Prague, CZ, 2011.

Malcolm Slaney. "Web-Scale Multimedia Indexing and Retrieval." Invited talk at Cal IT Conference, UCSD, February 2011.

Malcolm Slaney. "Web-Scale Multimedia Indexing and Retrieval." Invited keynote at IS&T and SPIE International Conference on Multimedia Content Access: Algorithms and Systems V, San Francisco, CA, January 2011.

Cees G.M. Snoek, Malcolm Slaney. "Academia Meets Industry at the Multimedia Grand Challenge." In IEEE Multimedia Magazine, pp. 4-7, January 2011. [PDF]

2010
Gregory Sell and Malcolm Slaney. "Solving Demodulation as an Optimization Problem." IEEE Transactions on Audio, Speech and Language Processing, pp. 2051-2066, Nov. 2010. [PDF and code]

Dhruv Kumar Mahajan and Malcolm Slaney. "Image classification using the web graph." In Proceedings of the International Conference on Multimedia (MM '10). ACM, Florence, Italy, 991-994, 2010. [PDF]

Greg Sell and Malcolm Slaney. "The Information Content of Demodulated Speech," Proceedings of the International Conference on Acoustics, Speech and Signal Processing, Dallas, Texas, March, 2010. [PDF]

Malcolm Slaney. "Multimodal Retrieval and Ranking: More than Waveforms." Invited talk at 11th ACM SIGMM International Conference on Multimedia Information Retrieval, Philadelphia, PA, 241-242, March 2010.

2009
Eva Hörster, Malcolm Slaney, Marc’Aurelio Ranzato, Kilian Weinberger. "Unsupervised Image Ranking." Proceedings of the 2009 ACM Multimedia: Workshop on Large-Scale Multimedia Retrieval and Mining, Beijing, China, October 2009. [PDF]

Lyndon Kennedy, Malcolm Slaney, Kilian Weinberger. "Reliable Tags Using Image Similarity: Mining Specificity and Expertise from Large-Scale Multimedia Databases." Proceedings of the 2009 ACM Multimedia: Workshop on Web-Scale Multimedia Corpus, Beijing, China, October 2009. [PDF]

Theodore Yu, Andrew Schwartz, John Harris, Malcolm Slaney, and Shih-Chii Liu. "Periodicity Detection and Localization using Spike Timing from the AER EAR." Proceedings of the 2009 IEEE International Symposium on Circuits and Systems, Taipei, Taiwan, May 2009. [PDF]

Misha Pavel, Malcolm Slaney, Hynek Hermansky. "Reconciliation of Human and Machine Speech Recognition Performance." Proceedings of the 2008 International Conference on Acoustics, Speech and Signal Processing, Taipei, Taiwan, April 2009. [PDF]


2008
Kilian Weinberger, Malcolm Slaney, Roelof van Zwol. "Resolving Tag Ambiguity." Proceedings of the 16th ACM international Conference on Multimedia, Vancouver, British Columbia, Canada, pp. 111-120, October 26-31, 2008. [PDF]

Malcolm Slaney, Kilian Weinberger, William White. "Learning a Metric for Music Similarity." Proceedings of the International Society of Music-Information Retrieval, pp. 313-318, Philadelphia, PA, September, 2008. [PDF]

Eva Hörster, Rainer Lienhart, Malcolm Slaney. "Continuous Visual Vocabulary Models for pLSA-Based Scene Recognition." ACM International Conference on Image and Video Retrieval (CIVR) 2008, pp. 319-328, Niagara Falls, Canada, 2008. [PDF]

Eva Hörster, Thomas Greif, Rainer Lienhart, Malcolm Slaney. "Comparing Local Feature Descriptors in pLSA-Based Image Models." 30th Annual Symposium of the German Association for Pattern Recognition (DAGM), G. Rigoll, Ed. Lecture Notes In Computer Science, vol. 5096. Springer-Verlag, pp. 446-455, Munich, Germany, 2008. [PDF]

Michael Casey, Christophe Rhodes, Malcolm Slaney. "Analysis of Minimum Distances in High-Dimensional Musical Spaces." IEEE Transactions on Audio, Speech, and Language Processing, vol.16, no.5, pp.1015-1028, July 2008. [PDF]

Michael A. Casey, R. Veltkamp, M. Goto, M. Leman, C. Rhodes, Malcolm Slaney. "Content-Based Music Information Retrieval: Current Directions and Future Challenges." Proceedings of the IEEE, vol.96, no.4, pp.668-696, April 2008. [PDF]

Malcolm Slaney, Michael Casey. "Locality-Sensitive Hashing for Finding Nearest Neighbors." IEEE Signal Processing Magazine, vol.25, no.2, pp.128-131, March 2008. [PDF]

Malcolm Slaney, D. P. W. Ellis, M. Sandler, M. Goto, M. Goodwin. "Introduction to the Special Issue on Music Information Retrieval." IEEE Transactions on Audio, Speech, and Language Processing, vol.16, no.2, pp.253-254, Feb. 2008 [PDF]

Kyogu Lee, Malcolm Slaney. "Acoustic Chord Transcription and Key Extraction from Audio using Key-dependent HMMs Trained on Synthesized Audio." IEEE Transactions on Audio, Speech, and Language Processing, vol.16, no.2, pp.291-301, Feb. 2008. [PDF]

2007
Malcolm Slaney, William White. "Similarity Based on Rating Data." Proceedings on the International Society of Music-Information Retrieval, pp. 479-484, Vienna, Austria, Sept. 2007. [PDF]

Kyogu Lee and Malcolm Slaney. "A Unified System for Chord Transcription and Key Extraction Using Hidden Markov Models." Proceedings on the International Society of Music-Information Retrieval, Vienna, Austria, September 2007. [PDF]

Eva Hörster, Rainer Lienhart, Malcolm Slaney. "Image Retrieval on Large-scale Image Databases." Proceedings of the 6th ACM International Conference on Image and video retrieval CIVR ’07, July 2007. [PDF]

Benjamin M. Marlin, Richard S. Zemel, Sam Roweis, and Malcolm Slaney. "Collaborative Filtering and the Missing at Random Assumption." In the Proceedings of the 23rd Conference on Uncertainty in Artificial Intelligence (UAI2007), IJCAI, July 2007. [PDF]

Rainer Lienhart, Malcolm Slaney. "pLSA on Large-scale Image Databases." Proceedings of the 2007 International Conference on Acoustics, Speech and Signal Processing, Honolulu, Hawaii, April 2007. [PDF]

David Anderson, Sourabh Ravindran, Malcolm Slaney. "Varying Time Constants and Gain Adaptation in Feature Extraction for Speech Processing." Proceedings of the 2007 International Conference on Acoustics, Speech and Signal Processing, Honolulu, Hawaii, April 2007. [PDF]

Michael Casey, Malcolm Slaney. "Fast Recognition of Remixed Music Audio." Proceedings of the 2007 International Conference on Acoustics, Speech and Signal Processing, Honolulu, Hawaii, April 2007. [PDF]

S. H. Srinivasan and M. Slaney. "A Bipartite Graph Model for Associating Images and Text." IJCAI-2007 Workshop on Multimodal Information Retrieval, Hyderabad, India, January 6, 2007. [PDF]

2006
Malcolm Slaney and William White. "Measuring Playlist Diversity for Recommendation Systems." Proceedings of the Audio and Music Computing for Multimedia Workshop in Conjunction with ACM Multimedia, October 2006. [PDF]

Song Chon, Malcolm Slaney and Jonathan Berger. "Predicting Success from Music Sales Data: A Statistical and Adaptive Approach." Proceedings of the Audio and Music Computing for Multimedia Workshop in Conjunction with ACM Multimedia, October 2006. [PDF]

Kyogu Lee, Malcolm Slaney. "Automatic Chord Recognition from Audio Using a Supervised HMM Trained with Audio-from-Symbolic Data." Proceedings of the Audio and Music Computing for Multimedia Workshop in conjunction with ACM Multimedia, October 2006. [PDF]

Kyogu Lee, Malcolm Slaney. "Automatic Chord Recognition Using an HMM with Supervised Learning." in Proceedings of International Conference in Music Information Retrieval, Victoria, BC, October 2006. [PDF]

Michael Casey and Malcolm Slaney. "Song Intersection by Approximate Nearest Neighbor Search." Proceedings on the International Society of Music-Information Retrieval, Victoria, BC, October 2006. [PDF]

Hiroko Terasawa, Malcolm Slaney and Jonathan Berger. "A Statistical Model of Timbre Perception." Proceedings of ISCA Tutorial and Research Workshop on Statistical And Perceptual Audition (SAPA2006), Pittsburgh, PA, September 2006. [PDF]

Hiroko Terasawa, Malcolm Slaney and Jonathan Berger. "Determining the Euclidean Distance between Two Steady State Sounds." Proceedings of 9th International Conference on Music Perception and Cognition (ICMPC9), Bologna, Italy, August 2006. [PDF]

Nima Mesgarani, Malcolm Slaney, Shihab Shamma. "Discrimination of Speech from Non-speech Based on Multiscale Spectro-temporal Modulations." IEEE Transactions on Audio, Speech and Language Processing, 14(6), 920-930, May 2006. [PDF]

Michael Casey and Malcolm Slaney. "The Importance of Sequences in Musical Similarity." IEEE International Conference on Acoustics, Speech and Signal Processing, Toulouse, France, May 2006. [PDF]

Daniel M. Russell, Malcolm Slaney, Yan Qu, Mave Houston. "Being Literate with Large Document Collections: Observational Studies and Cost Structure Tradeoffs." Proceedings of the 39th Annual Hawaii International Conference on System Sciences (HICSS ’06), Volume 3, 04-07 Jan. 2006, Page(s):55. [PDF]

2005
Malcolm Slaney. "The History and Future of CASA." In Speech Separation by Humans and Machines, Editor: P. Divenyi, Kluwer, pp 199-211, 2005. [PDF]

Hiroko Terasawa, Malcolm Slaney, Jonathan Berger. "The thirteen colors of timbre." Proceedings of the 2005 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, October 16-19, 2005. [PDF]

Grace Crowder, Sterling Foster, Daniel M. Russell, Malcolm Slaney, Lisa Yanguas. "Analytic worksheets: A framework to support human analysis of large streaming data volumes." Proceedings of Interact 2005, Rome, Italy, September 12-16, 2005. [PDF]

A. Ihlefeld, and Malcolm Slaney. "The Story of AudioSapiana." The Neuromorphic Engineer, http://www.ine-news.org/view.php?source=0029-2005-12-1, 2005. [Link]

Hiroko Terasawa, Malcolm Slaney, Jonathan Berger. "A Timbre Space for Speech." Proceedings of InterSpeech 2005, Lisbon, Portugal, September 4-8, 2005. [PDF]

Hiroko Terasawa, Malcolm Slaney, Jonathan Berger. "Perceptual Distance in Timbre Space." Japan Acoustical Society, August, 2005, (in Japanese.) [PDF]

Hiroko Terasawa, Malcolm Slaney, Jonathan Berger. "Perceptual distance in timbre space." Proceedings of International Conference on Auditory Display (ICAD) 2005, Limerick, Ireland, July 6-9, 2005. [PDF]

Daniel M. Russell, Malcolm Slaney, Yan Qu, Mave Houston. "A Cost Structure Analysis of Manual and Computer-supported Sensemaking Behavior." Proceedings of Intelligence Analysis 2005, McLean, VA, May 2-6, 2005. [PDF]

Malcolm Slaney and D. M. Russell. "Measuring information understanding in large document collections." HICSS ’05. Proceedings of the 38th Annual Hawaii International Conference on System Sciences, Page(s):105-105, Jan. 03-06, 2005. [PDF]

2004
Nima Mesgarani, Malcolm Slaney, Shihab Shamma. "Speech discrimination based on multiscale spectro-temporal modulations." Proceedings IEEE International Conference on Acoustics, Speech, and Signal Processing. (ICASSP '04), Page(s):I-601-604, May 17-21, 2004. [PDF]

Sourabh Ravindran, David Anderson, Malcolm Slaney. "Low-power Audio Classification for Ubiquitous Sensor Networks." Proceedings IEEE International Conference on Acoustics, Speech, and Signal Processing. (ICASSP '04), Page(s):iv-337-340, May 17-21 2004. [PDF]

2003
Malcolm Slaney, Dulce Ponceleon, James Kaufman. "Understanding the Semantics of Media." In Video Mining, A. Rosenfeld, D. Doermann, D. DeMenthon (editors). Kluwer Academic Publishers, Boston, 2003. [PDF]

Malcolm Slaney, Jayashree Subrahmonia, Paul Maglio. "Modeling Multitasking Users." Published in Spring-Verlag Lecture Notes in Artificial Intelligence, UM2003 User Modeling: Proceedings of the Ninth International Conference, July 2003. [PDF]

2002
Malcolm Slaney, Gerald McRoberts. "BabyEars: A Recognition System for Affective Vocalizations." Speech Communication, 2002. [PDF]

Malcolm Slaney. "Mixtures of Probability Experts for Audio Retrieval and Indexing." Proceedings of the International Conference on Multimedia and Expo, Lausanne, Switzerland, August 2002. [PDF]

Malcolm Slaney. "Semantic-Audio retrieval." Invited paper in Proceedings of 2002 International Conference on Acoustics, Speech and Signal Processing, Orlando, CA, May 2002. [PDF]

2001
Malcolm Slaney, Dulce Ponceleon, James Kaufmann. "Multimedia edges: Finding Hierarchy in all Dimensions." Proceedings ACM Multimedia Conference, Los Angeles, CA, October 2001. [Link]

Malcolm Slaney and Dulce Ponceleon. "Hierarchical segmentation using latent semantic indexing in scale space." Proceedings of the 2001 International Conference on Acoustics, Speech and Signal Processing, Salt Lake City, UT, May 2001. [PDF]

Michele Covell, Malcolm Slaney and Art Rothstein. "FastMPEG: Time-scale Modification of Bit-compressed Audio Information." Proceedings of the 2001 International Conference on Acoustics, Speech and Signal Processing, Salt Lake City, UT, May 2001. [PDF]

Malcolm Slaney, Dulce Ponceleon. "Hierarchical segmentation: Finding changes in a Text Signal." Proceedings of the SIAM Text Mining 2001 Workshop, Chicago, IL, pp. 6-13, April 7, 2001 [PDF]

Malcolm Slaney, Michele Covell. "FaceSync: A Linear Operator for Measuring Synchronization of Video Facial Images and Audio Tracks." In Advances in Neural Information Processing Systems 13, edited by Leen, Todd K., Dietterich, Thomas G. and Tresp, Volker, MIT Press, 2001. [PDF]

Steve Greenberg, Malcolm Slaney (editors). Computational Models of Auditory Function. IOS Press, Amsterdam, 2001. [Full Text as PDF]

2000
Malcolm Slaney, Better CHI with Signal Computation, Talk at Stanford PCD Seminar, CS 547, January 7, 2000. [link]

Malcolm Slaney and Michele Covell, "Matlab Multidimensional Scaling Tools," Interval Technical Report #2000-025, 2000. [link]


1998
Malcolm Slaney. "Auditory Toolbox," Interval Research Technical Report 1998-010, 1998. [Web site for software]

Michele Covell, Margaret Withgott, and Malcolm Slaney. "Mach1: Nonuniform Time-scale Modification of Speech," Proceedings IEEE International Conference on Acoustics, Speech, and Signal Processing, Seattle WA, vol. 1, pp. 349-352, May 12-15 1998. [PDF, Technical Report with audio]

Christoph Bregler, Michele Covell, Malcolm Slaney. "Video Rewrite: Photorealistic Synthetic Lip Sync." Proceedings INA Imagina, Monte Carlo, Monaco, pp.193-203, March 4-6 1998, (invited). [PDF]

Malcolm Slaney and Gerald McRoberts. "BabyEars: A Recognition System for Affective Vocalizations." Proceedings of the 1998 International Conference on Acoustics, Speech, and Signal Processing, Seattle, WA, vol. 2, pp. 985-988, May 12-15, 1998. [PDF]

Malcolm Slaney. "Connecting Correlograms to Neurophysiology and Psychoacoustics." In Psychophysical and Physiological Advances in Hearing, A.R. Palmer, A. Rees, A.Q. Summerfield and R. Meddis (editors). Whurr Publishers, London, 1998. [PDF]

Malcolm Slaney. "A Critique of Pure Audition." In Computational Auditory Scene Analysis, David F. Rosenthal , Hiroshi G. Okuno (editors). Erlbaum, Mahwah, N.J. 1998. [ PDF or Web site]

1997
Christoph Bregler, Michele Covell, Malcolm Slaney. "Video Rewrite: Visual speech Synthesis from Video." Proceedings the 1997 ACM SIGGRAPH, Los Angeles, pp. 353-360, August 1997. [PDF and paper with samples]

Eric Scheirer, Malcolm Slaney. "Construction and Evaluation of a Robust Multifeature Speech/music Discriminator." Proceedings of the 1997 IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 2, pp. 1331-1334, April 1997. [PDF]

1996
Christoph Bregler, Stephen Omohundro, Michele Covell, Malcolm Slaney, Subutai Ahmad, David A.Forsyth, Jerry A. Feldman. "Probabilistic Models of Verbal and Body Gestures." In Computer Vision in Man-Machine Interfaces, R. Cipolla, A.Pentland (editors). Cambridge University Press, Cambridge, UK, 1996. [PDF]

1995
Malcolm Slaney. "Pattern Playback in the ’90s." in Advances in Neural Information Processing Systems 7, Gerald Tesauro, David Touretzky,Todd Leen (editors). MIT Press, Cambridge, MA, pp. 827-834, 1995. [PDF]

Malcolm Slaney. "Pattern Playback from 1950 to 1995." 1995 IEEE International Conference on Systems, Man and Cybernetics, Seattle, Vol. 4, pp. 3519-3524, Oct. 1995. [Link]

Chris Bregler and Malcolm Slaney, "Snakes-A MatLab MEX file to demonstrate snake contour-following," Interval Technical Report #1995-017, 1995. [Link to code]

Malcolm Slaney. "A Critique of Pure Audition." Proceedings of the Computational Auditory Scene Analysis Workshop, 1995 International Joint Conference on Artificial Intelligence, Montreal, Canada, August 19-20, 1995. [Link]

Malcolm Slaney, Michele Covell, Bud Lassiter. "Automatic Audio Morphing." Proceedings of the 1995 International Conference on Acoustics Speech and Signal Processing, Atlanta, GA, vol. 2, pp. 1001-1004, May 1995. [PDF or paper with sound examples]

1994
Malcolm Slaney, Daniel Naar, Richard F. Lyon. "Auditory Model Inversion for Sound Separation." Proceedings of the 1994 International Conference on Acoustics Speech and Signal Processing, Adelaide, SA, Australia, vol. II, pp. 77-80, April 1994. [PDF]

Malcolm Slaney. "An Introduction to Auditory Model Inversion." Invited talk at the 1994 ATR Workshop on a Biological Framework for Speech Perception and Production, Kyoto Japan, September 16-17, 1994. [Link to paper and sound examples]

1993
Malcolm Slaney, Richard F. Lyon. "On the Importance of Time: A Temporal Representation of Sound." In Visual Representations of Speech, Martin Cooke, Steve Beet, Malcolm Crawford (editors). J. Wiley, New York, pp. 95-116, 1993. [PDF]

Malcolm Slaney. "An Efficient Implementation of the Patterson-Holdsworth Auditory Filter Bank." Apple Computer Technical Report #35, Apple Computer, Inc., Cupertino, CA, 1993. [Mathematica notebook and PDF]

Malcolm Slaney. "A Review of Filter Design." Apple Computer Technical Report #34, Apple Computer, Inc., Cupertino, CA, 1993. [Mathematica notebook and PDF]

1992
Malcolm Slaney. "Interactive Signal-processing Documents." In Symbolic and Knowledge-Based Signal Processing, Alan V. Oppenheim, S. Hamid Nawab (editors) Prentice Hall, Englewood Cliffs, NJ, pp. 173-204, 1992. [PDF]

Malcolm Slaney. "On the Importance of Time." Invited talk at the 1992 Workshop on Music Representations, Capri, Italy, October 1992.

Malcolm Slaney. "On the Importance of Time---A Temporal Representation of Sound." Invited talk at the ESCA Workshop on Visual Representations of Speech, Sheffield, England, April 1992.

Malcolm Slaney. "Ear." Published as the SPECfp92 benchmark "ear," 1992. [Link]

Malcolm Slaney. "MacEar: A program that implements a cochlear model." [Code]

1991
Mark Fanty, Ron Cole, Malcolm Slaney. "A Comparison of DFT, PLP and Cochleagram for Alphabet Recognition." Conference Record of the Twenty-Fifth Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, vol.1, pp. 326-329, November 1991. [PDF]

Malcolm Slaney, Richard. F. Lyon. "Apple Hearing Demo Reel." Apple Computer Technical Report #25, Apple Computer, Inc., Cupertino, CA, 1991. [New web site or original PDF]

1990

Malcolm Slaney. "Interactive Signal Processing Documents." IEEE ASSP Magazine, pp. 8-20, April 1990. [PDF]

Yeshwant K. Muthusamy, Ron Cole, Malcolm Slaney. "Speaker-independent Vowel Recognition: Spectrograms versus Cochleagrams." Proceedings of the 1990 International Conference on Acoustics, Speech, and Signal Processing, Albuquerque, NM, vol. 5, pp. 533-536, April 1990. [PDF]

Richard Duda, Richard Lyon, Malcolm Slaney. "Correlograms and the Separation of Sounds." Conference Record: Twenty-Fourth Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, pp. 457-461, November 1990. [PDF]

Malcolm Slaney, Richard F. Lyon. "A Perceptual Pitch Detector." Proceedings of the 1990 International Conference on Acoustics, Speech, and Signal Processing, Albuquerque, NM, vol. 1, pp. 357-360, April 1990. [PDF]

1989
Mike Obermier, Gus Pabon, Malcolm Slaney, Larry Yaeger, Steve Nowlan. "Supercomputer research and engineering applications at Apple." Cray Channels, vol.11, no.3, p. 6-11, Fall 1989. [Large PDF, 100M]

Malcolm Slaney. "Implementing Cochlear Models." Invited talk at 1989 IEEE Asilomar Microprocessor Workshop, Asilomar Conference Center, Pacific Grove, CA, April 27, 1989.

Malcolm Slaney. "How Apple uses a CRAY Supercomputer to Design Personal Computers." Invited talk at Cray Research Japan Ltd. Technical Seminar, Tokyo, Japan, April 18, 1989.

1988
A. C. Kak, Malcolm Slaney. Principles of Computerized Tomographic Imaging. IEEE Press, New York, 1988. Republished by the Society of Industrial and Applied Mathematics (SIAM) in their series "Classics in Applied Mathematics," 2001. This book is available on the Internet at http://www.slaney.org/pct. [Code]

Malcolm Slaney. "Lyon's Cochlear Model." Apple Computer Technical Report #13, Apple Computer, Inc., Cupertino, CA, November 1988. [Mathematica notebook or PDF]

Malcolm Slaney. "SunTroff: A program for displaying device independent TROFF on a Sun Workstation."" Published on the 1988 Sun Users Group Tape. Basis for the Linux gxditview tool.

1986
Malcolm Slaney, Mani Azimi, A. C. Kak, Lawrence. E. Larson. "Microwave Imaging with First-order Diffraction Tomography." In Medical Applications of Microwave Imaging, L. E. Larsen, J. H. Jacobi (editors). IEEE Press, New York, pp. 184-212, 1986. [PDF]

1985

Malcolm Slaney, A. C. Kak. "Imaging with Higher Order Diffraction Tomography." Proceedings of the IEEE 1985 Ultrasonics Symposium, San Francisco, vol. 2, pp. 808-813, 1985. [PDF]

Malcolm Slaney. "Imaging with Diffraction Tomography." PhD Dissertation, Purdue University, 1985. [PDF or code]

1984
Malcolm Slaney, A. C. Kak, L.E. Larson. "Limitations of Imaging with First-order Diffraction Tomography." IEEE Transactions on Microwave Theory and Techniques, vol. MTT-32, pp. 860-873, August 1984. [PDF]

1983

Malcolm Slaney, A. C. Kak. "Diffraction Tomography." Inverse Optics: Proceedings of the SPIE, Vol. 412, Arlington, VA, pp. 2-19, April 1983. [PDF]
Carl Crawford, Mani Azimi, Malcolm Slaney. "The CRC Plotting Package." Department of Electrical and Computer Engineering Technical Report #527, 10-1-1984. [PDF>]
A. C. Kak, Robert. J. Safranek, Malcolm Slaney, Marc Andersen. "Depth Perception for Robot Vision: A Survey of Competing Technologies." Conference on Artificial Intelligence, Rochester, MI, 1983.

1982
A. C. Kak, Mani Azimi, Malcolm Slaney. "Estimation of Porosity in Composites." Review of Progress in Quantitative NDE Conference, San Diego, CA, vol. 2a, pp. 851-866, August 1982. [PDF]