Malcolm Slaney
Publications by Year
(See also
Google Scholar)
2024
Malcolm Slaney and Matthew Fitzgeral.
Comparing human and machine speech recognition in noise with QuickSIN.
JASA Express Letters 4, 095202 (2024).
HTML
Richard F. Lyon, Rob Schonberger, Malcolm Slaney, Mihajlo Velimirovic, Honglin Yu.
The CARFAC v2 Cochlear Model in Matlab, NumPy, and JAX.
https://arxiv.org/abs/2404.17490
Jens Hjortkjaer, Daniel DE Wong, Alessandro Catania, Jonatan Marcher-Rorsted, Enea Ceolini, Soren Asp Fuglsang, Ilya Kiselev,
Giovanni Di Liberto, Shih-Chii Liu, Torsten Dau, Malcolm Slaney, Alain de Cheveigne,
Real-time control of a hearing instrument with EEG-based attention decoding.
bioRxiv, 2024.03. 01.582668. PDF
2023
C Bregler, M Covell, Malcolm Slaney.
Video rewrite: Driving visual speech with audio.
Seminal Graphics Papers: Pushing the Boundaries, Volume 2, ACM SigGraph, pp. 715-722, 2023.
PDF
A Omran, N Zeghidour, Z Borsos, F de Chaumont Quitry, Malcolm Slaney, M. Tagliasacchi.
Disentangling speech from surroundings with neural embeddings.
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes, Greece, 2023.
PDF
DT Speckhard, K Misiunas, S Perel, T Zhu, S Carlile, Malcolm Slaney.
Neural architecture search for energy-efficient always-on audio machine learning.
Neural Computing and Applications 35 (16), 12133-12144, 2023.
PDF
Malcolm Slaney.
Machine Learning for Audition.
Keynote at the Virtual Conference on Computational Audition, 2023.
2022
C Han, EM Kaya, K Hoefer, Malcolm Slaney, S Carlile.
Multi-channel speech denoising for machine ears.
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022.
PDF
2021
Malcolm Slaney, Richard F. Lyon.
Speech and hearing for the next billion users.
Invited presentation at the 181st Meeting
Acoustical Society of America,
Seattle, Washington, December 2021.
Artem Dementyev, Pascal Getreuer, Dimitri Kanevsky, Malcolm Slaney,
Richard F Lyon.
VHP: Vibrotactile Haptics Platform for On-body Applications.
UIST '21: The 34th Annual ACM Symposium on User Interface
Software and Technology,
October 2021 Pages 598-612.
PDF
Alain de Cheveigné, Malcolm Slaney, Søren A Fuglsang and Jens Hjortkjaer. Auditory stimulus-response modeling with a match-mismatch task.
Journal of Neural Engineering, Volume 18, Number 4, 2021.
PDF
2020
Malcolm Slaney, Richard F Lyon, Ricardo Garcia, Brian Kemler, Chet Gnegy, Kevin Wilson, Dimitri Kanevsky, Sagar Savla, Vinton G Cerf. Auditory measures for the next billion users.
Ear and Hearing, 41, pp. 131S-139S, 2020.
PDF
Gitte Keidser, Graham Naylor, Douglas S. Brungart, Andreas Caduff, Jennifer Campos, Simon Carlile, Mark G. Carpenter, Giso Grimm, Volker Hohmann, Inga Holube, Stefan Launer, Thomas Lunner, Ravish Mehra, Frances Rapport, Malcolm Slaney, and Karolina Smeds.
The Quest for Ecological Validity in Hearing Science: What It Is, Why It Matters, and How to Advance It.
Ear and Hearing, 41. pp. 5S-19S, 2020.
PDF
Jaswanth Reddy Katthi, Sriram Ganapathy, Sandeep Kothinti, Malcolm Slaney.
Deep Canonical Correlation Analysis For Decoding The Auditory Brain.
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), 2020.
PDF
2019
Sijia Zhao, Nga Wai Yum, Lucas Benjamin, Elia Benhamou, Makoto Yoneya, Shigeto Furukawa, Fred Dick, Malcolm Slaney, Maria Chait.
Rapid ocular responses are modulated by bottom-up driven auditory salience.
Journal of Neuroscience, 0776-19, 2019.
PDF
SC Liu, JG Harris, M Elhilali, M Slaney.
Bio-inspired Audio Processing, Models and System,
Frontiers in Neuroscience, 13, 2019.
HTML
Ariel Goldstein, Aren Jansen, Malcolm Slaney, Amy Price, Zaid Kokaja Zada, Gina Choe, Bobbi Aubrey, Aditi Rao, Lora Fanda, Kenneth Norman, Adeen Flinker, Orrin Devinsky, Michael Brenner, Uri Hasson.
Temporal Dynamics of Meaning. 2019 Conference on Cognitive Computational Neuroscience, Berlin, Germany, 13-16 September 2019.
PDF
2018
Nicholas Huang, Malcolm Slaney, Mounya Elhilali.
Connecting Deep Neural Networks to Physical, Perceptual, and Electrophysiological Auditory Signals.
Frontiers in Neuroscience, Volume 12, pages 532, 2018.
PDF
Daniel D. E. Wong, Soren A. Fuglsang, Jens Hjortkaer, Enea Ceolini, Malcolm Slaney, Alain de Cheveigne.
A Comparison of Regularization Methods in Forward and Backward Models for Auditory Attention Decoding.
Frontiers in Neuroscience, Volume 12, 531, 2018.
PDF
Ken Hoover, Sourish Chaudhuri, Caroline Pantofaru, Ian Sturdy, Malcolm Slaney.
Using audio-visual information to understand speaker activity: Tracking active speakers on and off screen.
In ICASSP 2018, Banff, Canada, April 2018.
PDF
MH Anderson, BW Yazel, MPF Stickle, Iniguez FD Espinosa, NS Gutierrez, Malcolm Slaney, SS Joshi, LM Miller.
Towards mobile gaze-directed beamforming: a novel neuro-technology for hearing loss.
Proc, IEEE Engineering in Medicine and Biology Society Conference, 2018.
PDF
Alain de Cheveigne, Daniel D.E. Wong, Giovanni M. Di Liberto, Jens Hjortkaer, Malcolm Slaney, Edmund Lalor.
Decoding the auditory brain with canonical component analysis.
NeuroImage, Volume 172, pp. 206-216, 15 May 2018.
PDF
2017
Shawn Hershey, Sourish Chaudhuri, Daniel Ellis, Jort Gemmeke, Aren Jansen, Channing Moore,
Manoj Plakal, Devin Platt, Rif Saurous, Bryan Seybold, Malcolm Slaney, Ron Weiss, Kevin Wilson.
CNN Architectures for Large-scale Audio Classification.
In ICASSP 2017, New Orleans, March 2017.
[arXiv PDF]
2016
Daniel D. E. Wong, Ulrich Pomper, Emina Alickovic, Jens Hjortkaer, Malcolm Slaney, Shihab Shamma, Alain de Cheveigne.
Decoding Speech Sound Source Direction from Electroencephalography Data. ARO winter meeting (abstract), February 2016.
2015
T. J. Tsai, Andreas Stolcke, and Malcolm Slaney.
A Study of Multimodal Addressee Detection in
Human-Human-Computer Interaction.
IEEE Transactions on Multimedia,
17(9), September 2015.
[PDF]
TJ Tsai, Andreas Stolcke, and Malcolm Slaney.
Multimodal addressee detection in multiparty dialogue systems.
In Proc. IEEE ICASSP, IEEE SPS, Brisbane, Australia. April 2015.
[PDF]
Anna Prokofieva, Malcolm Slaney, Dilek Hakkani-Tür.
Probabilistic features for conecting eye gaze to spoken language
understanding.
In Proc. IEEE ICASSP, IEEE SPS, Brisbane, Australia. April 2015.
[PDF]
2014
Malcolm Slaney and Dilek Hakkani-Tür. Eye gaze for speech
recognition and understanding. Demonstration at SLT 2014, South Lake
Tahoe, CA, December 2014. [PDF]
Anna Prokofieva, Dilek Hakkani-Tür, Malcolm Slaney. Eye gaze for
understanding conversational speech. IEEE Workshop on Spoken
Language Technology, South Lake Tahoe, NV, December 2014. [PDF]
Sree Harsha Yella, Andreas Stolcke, Malcolm Slaney. Artificial
neural network features for speaker diarization. IEEE Workshop
on Spoken Language Technology, South Lake Tahoe, NV, December
2014. [PDF]
Malcolm Slaney, Andreas Stolcke, Dilek Hakkani-Tür. "The relation of
eye gaze and face pose: Potential impact on speech recognition." ACM
International Conference on Multimodal Interactions (ICMI),
Istanbul, Turkey, November 2014. [PDF]
Dilek Hakkani-Tür, Malcolm Slaney, Asli Celikyilmaz, Larry Heck.
"Eye gaze for spoken language understanding in multi-modal
conversational interactions." ACM International Conference on
Multimodal Interactions (ICMI), Istanbul, Turkey, November
2014. [PDF]
Malcolm Slaney. Spectrogram Inversion Toolkit for Matlab. IEEE
Signal Processing Society SLTC Newsletter, November 2014. [PDF]
Phil Pitts, Arrigo Benedetti, Malcolm Slaney, and Phil Chou. Time of
Flight Tracer. Microsoft Research Technical Report MSR-TR-2014-142,
November 2014. [Technical
report or link to
code]
Malcolm Slaney, Michael L. Seltzer. "The influence of pitch and
noise on the discriminability of filterbank features." Proceedings
of Interspeech 2014, Singapore, 2014. [PDF]
Dong Yu, Adam Eversole, Michael L. Seltzer, Kaisheng Yao, Zhiheng
Huang, Brian Guenter, Oleksii Kuchaiev, Yu Zhang, Frank Seide,
Huaming Wang, Jasha Droppo, Geoffrey Zweig, Chris Rossbach, Jon
Currey, Jie Gao, Avner May, Baolin Peng, Andreas Stolcke, Malcolm
Slaney. "An Introduction to Computational Networks and the
Computational Network Toolkit." Microsoft Research, MSR-TR-2014-112,
October 2014. [Technical
report or link to code]
Yan Huang, Malcolm Slaney, Michael L. Seltzer, and Yifan Gong.
"Towards better performance with heterogeneous training data in
acoustic modeling using deep neural networks." Proceedings
of Interspeech 2014, Singapore, 2014. [PDF]
Malcolm Slaney. "Spectrogram Inversion Toolbox." Microsoft Research,
September 2014. [Software]
Sook Young Won, Jonathan Berger, and Malcolm Slaney. "Simulation of
one's own voice in a two-parameter model." Proceedings of the
International Conference on Music Perception and Cognition
(ICMPC), Seoul, South Korea, August 2014. [PDF]
Neville Ryant, Malcolm Slaney, Mark Liberman, Elizabeth Shriberg,
and Jiahong Yuan. "Highly accurate mandarin tone classification in
the absence of pitch information." In the Proceedings of Speech
Prosody, Dublin, May 2014. [PDF]
Malcolm Slaney, Rahul Rajan, Andreas Stolcke, and Partha
Parthasarathy. "Gaze-enhanced speech recognition."
In Proc. IEEE
ICASSP, IEEE SPS, Florence, Italy. May 2014. [PDF]
James A. O'Sullivan, Alan J. Power, Nima Mesgarani,
Siddharth Rajaram, John J. Foxe, Barbara G.
Shinn-Cunningham, Malcolm Slaney, Shihab A. Shamma and
Edmund C. Lalor. "Attentional Selection in a Cocktail Party
Environment Can Be Decoded from Single-Trial EEG." Cerebral
Cortex, January 2014. [PDF]
2013
Blair Kaneshiro, Hyung-Suk Kim, Jorge Herrera, Jieun Oh, Jonathan
Berger and Malcolm Slaney. "QBT-Extended: An annotated dataset of
melodically contoured tapped queries," in Proceedings of the
International Society of Music Information Retrieval (ISMIR),
Curitiba, PR, Brazil, November 2013. [PDF]
Seyed Omid Sadjadi, Malcolm Slaney, and Larry Heck. "MSR Identity
Toolbox v1.0: A MATLAB Toolbox for Speaker-Recognition Research." IEEE
Signal Processing Society SLTC Newsletter, November 2013. [Link]
Malcolm Slaney, Elizabeth Shriberg, Jui-Ting Huang. "Pitch-Gesture
Modeling Using Subband Autocorrelation Change Detection," in Proceedings
of InterSpeech 2013, Lyon, France, August 2013. [PDF and
software]
Jieun Oh, Eunjoon Cho, Malcolm Slaney. "Contours of Syllabic-Level
Units in Laughter," in Proceedings of InterSpeech 2013,
Lyon, France, August 2013. [PDF]
Seyed Omid Sadjadi,
Malcolm Slaney, and Larry Heck. "MSR Identity Toolbox v1.0: A MATLAB
Toolbox for Speaker-Recognition Research."" Microsoft Research,
November 2013. [Software]
Edmund Lalor, Nima Mesgarani, Siddharth Rajaram, Adam O'Donovan,
James Wright, Inyong Choi, Jonathan Brumberg, Nai Ding, Adrian KC
Lee, Nils Peters, Sudarshan Ramenahalli, Jeffrey Pompe, Barbara
Shinn-Cunningham, Malcolm Slaney, and Shihab Shamma. "Decoding
Auditory Attention (in Real Time) with EEG," in Proceedings of
the 37th ARO MidWinter Meeting, Association for Research in
Otolaryngology (ARO), 17 February 2013. [PDF]
Ivan Tashev and Malcolm Slaney. "Data Driven Suppression Rule for
Speech Enhancement." In Information Theory and Applications
Workshop, University of California - San Diego, 14 February
2013. [PDF]
Ramesh Jain and Malcolm Slaney. "Micro Stories and Mega Stories."
Visions and Views Column, IEEE Multimedia Magazine, January
2013. [PDF]
2012
Malcolm Slaney and Chris Bregler. "Image-based Facial Synthesis." In
Audio-Visual Speech Processing, Eric Bateson, Gérard Bailly
and Pascal Perrier, eds., Cambridge University Press, 2012.
Malcolm Slaney. "Pay Attention Please: Attention at the Telluride
Neuromorphic Cognition Workshop." IEEE Signal Processing Society
SLTC Newsletter, November 2012. [Link]
Aisling Kelliher and Malcolm Slaney. "Tell me a Story." Visions and
Views Column, IEEE Multimedia Magazine, Winter 2012. [PDF]
Juhan Nam, Jorge Herrera, Malcolm Slaney, Julius Smith. "Learning
Sparse Feature Representations for Music Annotation and Retrieval."
Proceedings of the International Society of Music-Information
Retrieval, Porto, Portugal, October 2012. [PDF]
Klara Nahrstedt and Malcolm Slaney, Malcolm. "Coulda, woulda,
shoulda: 20 years of multimedia opportunities." Proceedings of
the 20th ACM International Conference on Multimedia, Nara,
Japan, 2012. [PDF]
Ajay Divakaran, Malcolm Slaney, and Martha Larson. "Audio analysis
for consumer and other industrial applications." Proceedings of the
2012 ACM international workshop on Audio and multimedia methods for
large-scale video analysis, AMVA '12, Nara, Japan, pp.
33-34, 2012. [PDF]
Malcolm Slaney, Yury Lifshits, Junfeng He. "Optimal Parameters for
Locality-Sensitive Hashing." In a special issue of the Proceedings
of the IEEE on Web-Scale Multimedia, September 2012. [PDF
and Code]
David Ayman Shamma and Malcolm Slaney. "Don’t Click Here." Visions
and Views Column, IEEE Multimedia Magazine, Summer 2012. [PDF]
Malcolm Slaney, Trevor Agus, Shih-Chii Liu, Merve Kaya, Mounya
Elhilali. "A Model of Attention-Driven Scene Analysis." Proceedings
of the International Conference on Acoustics, Speech and Signal
Processing, Kyoto, Japan, March, 2012. [PDF]
2011
Dulce Ponceleon and Malcolm Slaney. "Multimedia Information
Retrieval." In Modern Information Retrieval, Ricardo
Baeza-Yates and Berthier Ribeiro-Neto, Second Edition, 2011.
Lyndon Kennedy, Malcolm Slaney. "Identifying Authoritative Sources
of Multimedia Content." Proceedings of the ACM Conference on
Multimedia, Scottsdale, AZ, November 2011. [PDF]
Malcolm Slaney. "Precision-Recall is Wrong for Multimedia." Visions
and Views Column, IEEE Multimedia Magazine, Fall 2011. [PDF]
Juhan Nam, Jiquan Ngiam, Honglak Lee and Malcolm Slaney. "A
Classification-based Polyphonic Piano Transcription Approach using
Learned Feature Representations." In Proceedings of the
International Symposium on Music Information Retrieval (ISMIR),
October 24, 2011. [PDF]
Malcolm Slaney. "Does Content Matter?" Visions and Views Column, IEEE
Multimedia Magazine, Summer 2011. [PDF]
Benjamin M. Marlin, Richard S. Zemel, Sam Roweis, and Malcolm
Slaney. "Recommender Systems: Missing Data and Statistical Model
Estimation." In IJCAI Best Paper Session, Barcelona, Spain,
July 2011. [PDF]
Vidhya Navalpakkam, Justin Rao, Malcolm Slaney. "Using Gaze Patterns
to Measure and Detect Distraction-induced Struggles while Reading."
In Proceedings of the ACM SIGCHI Conference on Human Factors in
Computing Systems, Vancouver, Canada, 2011. [PDF]
Malcolm Slaney and Patrick Naylor. Invited presentation for "Trends
Expert Summary: Audio and Acoustic Signal Processing," ICASSP,
Prague, CZ, 2011.
Malcolm Slaney. "Web-Scale Multimedia Indexing and Retrieval."
Invited talk at Cal IT Conference, UCSD, February 2011.
Malcolm Slaney. "Web-Scale Multimedia Indexing and Retrieval."
Invited keynote at IS&T and SPIE International Conference on
Multimedia Content Access: Algorithms and Systems V, San Francisco,
CA, January 2011.
Cees G.M. Snoek, Malcolm Slaney. "Academia Meets Industry at the
Multimedia Grand Challenge." In IEEE Multimedia Magazine,
pp. 4-7, January 2011. [PDF]
2010
Gregory Sell and Malcolm Slaney. "Solving Demodulation as an
Optimization Problem." IEEE Transactions on Audio, Speech and
Language Processing, pp. 2051-2066, Nov. 2010. [PDF
and code]
Dhruv Kumar Mahajan and Malcolm Slaney. "Image classification using
the web graph." In Proceedings of the International Conference
on Multimedia (MM '10). ACM, Florence, Italy, 991-994, 2010. [PDF]
Greg Sell and Malcolm Slaney. "The Information Content of
Demodulated Speech," Proceedings of the International Conference
on Acoustics, Speech and Signal Processing, Dallas, Texas,
March, 2010. [PDF]
Malcolm Slaney. "Multimodal Retrieval and Ranking: More than
Waveforms." Invited talk at 11th ACM SIGMM International
Conference on Multimedia Information Retrieval, Philadelphia,
PA, 241-242, March 2010.
2009
Eva Hörster, Malcolm Slaney, Marc’Aurelio Ranzato, Kilian
Weinberger. "Unsupervised Image Ranking." Proceedings of the
2009 ACM Multimedia: Workshop on Large-Scale Multimedia Retrieval
and Mining, Beijing, China, October 2009. [PDF]
Lyndon Kennedy, Malcolm Slaney, Kilian Weinberger. "Reliable Tags
Using Image Similarity: Mining Specificity and Expertise from
Large-Scale Multimedia Databases." Proceedings of the 2009 ACM
Multimedia: Workshop on Web-Scale Multimedia Corpus, Beijing,
China, October 2009. [PDF]
Theodore Yu, Andrew Schwartz, John Harris, Malcolm Slaney, and
Shih-Chii Liu. "Periodicity Detection and Localization using Spike
Timing from the AER EAR." Proceedings of the 2009 IEEE
International Symposium on Circuits and Systems, Taipei,
Taiwan, May 2009. [PDF]
Misha Pavel, Malcolm Slaney, Hynek Hermansky. "Reconciliation of
Human and Machine Speech Recognition Performance." Proceedings
of the 2008 International Conference on Acoustics, Speech and
Signal Processing, Taipei, Taiwan, April 2009. [PDF]
2008
Kilian Weinberger, Malcolm Slaney, Roelof van Zwol. "Resolving Tag
Ambiguity." Proceedings of the 16th ACM international Conference
on Multimedia, Vancouver, British Columbia, Canada, pp.
111-120, October 26-31, 2008. [PDF]
Malcolm Slaney, Kilian Weinberger, William White. "Learning a Metric
for Music Similarity." Proceedings of the International Society
of Music-Information Retrieval, pp. 313-318, Philadelphia, PA,
September, 2008. [PDF]
Eva Hörster, Rainer Lienhart, Malcolm Slaney. "Continuous Visual
Vocabulary Models for pLSA-Based Scene Recognition." ACM
International Conference on Image and Video Retrieval (CIVR) 2008,
pp. 319-328, Niagara Falls, Canada, 2008. [PDF]
Eva Hörster, Thomas Greif, Rainer Lienhart, Malcolm Slaney.
"Comparing Local Feature Descriptors in pLSA-Based Image Models." 30th
Annual Symposium of the German Association for Pattern Recognition
(DAGM), G. Rigoll, Ed. Lecture Notes In Computer Science, vol.
5096. Springer-Verlag, pp. 446-455, Munich, Germany, 2008. [PDF]
Michael Casey, Christophe Rhodes, Malcolm Slaney. "Analysis of
Minimum Distances in High-Dimensional Musical Spaces." IEEE
Transactions on Audio, Speech, and Language Processing, vol.16,
no.5, pp.1015-1028, July 2008. [PDF]
Michael A. Casey, R. Veltkamp, M. Goto, M. Leman, C. Rhodes, Malcolm
Slaney. "Content-Based Music Information Retrieval: Current
Directions and Future Challenges." Proceedings of the IEEE,
vol.96, no.4, pp.668-696, April 2008. [PDF]
Malcolm Slaney, Michael Casey. "Locality-Sensitive Hashing for
Finding Nearest Neighbors." IEEE Signal Processing Magazine,
vol.25, no.2, pp.128-131, March 2008. [PDF]
Malcolm Slaney, D. P. W. Ellis, M. Sandler, M. Goto, M. Goodwin.
"Introduction to the Special Issue on Music Information Retrieval."
IEEE Transactions on Audio, Speech, and Language Processing,
vol.16, no.2, pp.253-254, Feb. 2008 [PDF]
Kyogu Lee, Malcolm Slaney. "Acoustic Chord Transcription and Key
Extraction from Audio using Key-dependent HMMs Trained on
Synthesized Audio." IEEE Transactions on Audio, Speech, and
Language Processing, vol.16, no.2, pp.291-301, Feb. 2008. [PDF]
2007
Malcolm Slaney, William White. "Similarity Based on Rating Data." Proceedings
on the International Society of Music-Information Retrieval,
pp. 479-484, Vienna, Austria, Sept. 2007. [PDF]
Kyogu Lee and Malcolm Slaney. "A Unified System for Chord
Transcription and Key Extraction Using Hidden Markov Models." Proceedings
on the International Society of Music-Information Retrieval, Vienna,
Austria, September 2007. [PDF]
Eva Hörster, Rainer Lienhart, Malcolm Slaney. "Image Retrieval on
Large-scale Image Databases." Proceedings of the 6th ACM
International Conference on Image and video retrieval CIVR ’07, July
2007. [PDF]
Benjamin M. Marlin, Richard S. Zemel, Sam Roweis, and Malcolm
Slaney. "Collaborative Filtering and the Missing at Random
Assumption." In the Proceedings of the 23rd Conference on
Uncertainty in Artificial Intelligence (UAI2007), IJCAI, July
2007. [PDF]
Rainer Lienhart, Malcolm Slaney. "pLSA on Large-scale Image
Databases." Proceedings of the 2007 International Conference on
Acoustics, Speech and Signal Processing, Honolulu, Hawaii,
April 2007. [PDF]
David Anderson, Sourabh Ravindran, Malcolm Slaney. "Varying Time
Constants and Gain Adaptation in Feature Extraction for Speech
Processing." Proceedings of the 2007 International Conference on
Acoustics, Speech and Signal Processing, Honolulu, Hawaii,
April 2007. [PDF]
Michael Casey, Malcolm Slaney. "Fast Recognition of Remixed Music
Audio." Proceedings of the 2007 International Conference on
Acoustics, Speech and Signal Processing, Honolulu, Hawaii,
April 2007. [PDF]
S. H. Srinivasan and M. Slaney. "A Bipartite Graph Model for
Associating Images and Text." IJCAI-2007 Workshop on Multimodal
Information Retrieval, Hyderabad, India, January 6, 2007. [PDF]
2006
Malcolm Slaney and William White. "Measuring Playlist Diversity for
Recommendation Systems." Proceedings of the Audio and Music
Computing for Multimedia Workshop in Conjunction with ACM
Multimedia, October 2006. [PDF]
Song Chon, Malcolm Slaney and Jonathan Berger. "Predicting Success
from Music Sales Data: A Statistical and Adaptive Approach." Proceedings
of the Audio and Music Computing for Multimedia Workshop in
Conjunction with ACM Multimedia, October 2006. [PDF]
Kyogu Lee, Malcolm Slaney. "Automatic Chord Recognition from Audio
Using a Supervised HMM Trained with Audio-from-Symbolic Data."
Proceedings of the Audio and Music Computing for Multimedia
Workshop in conjunction with ACM Multimedia, October 2006. [PDF]
Kyogu Lee, Malcolm Slaney. "Automatic Chord Recognition Using an HMM
with Supervised Learning." in Proceedings of International
Conference in Music Information Retrieval, Victoria, BC,
October 2006. [PDF]
Michael Casey and Malcolm Slaney. "Song Intersection by Approximate
Nearest Neighbor Search." Proceedings on the International
Society of Music-Information Retrieval, Victoria, BC, October
2006. [PDF]
Hiroko Terasawa, Malcolm Slaney and Jonathan Berger. "A Statistical
Model of Timbre Perception." Proceedings of ISCA Tutorial and
Research Workshop on Statistical And Perceptual Audition
(SAPA2006), Pittsburgh, PA, September 2006. [PDF]
Hiroko Terasawa, Malcolm Slaney and Jonathan Berger. "Determining
the Euclidean Distance between Two Steady State Sounds." Proceedings
of 9th International Conference on Music Perception and Cognition
(ICMPC9), Bologna, Italy, August 2006. [PDF]
Nima Mesgarani, Malcolm Slaney, Shihab Shamma. "Discrimination of
Speech from Non-speech Based on Multiscale Spectro-temporal
Modulations." IEEE Transactions on Audio, Speech and Language
Processing, 14(6), 920-930, May 2006. [PDF]
Michael Casey and Malcolm Slaney. "The Importance of Sequences in
Musical Similarity." IEEE International Conference on Acoustics,
Speech and Signal Processing, Toulouse, France, May 2006. [PDF]
Daniel M. Russell, Malcolm Slaney, Yan Qu, Mave Houston. "Being
Literate with Large Document Collections: Observational Studies and
Cost Structure Tradeoffs." Proceedings of the 39th Annual Hawaii
International Conference on System Sciences (HICSS ’06),
Volume 3, 04-07 Jan. 2006, Page(s):55. [PDF]
2005
Malcolm Slaney. "The History and Future of CASA."
In Speech Separation by Humans and Machines, Editor: P. Divenyi,
Kluwer, pp 199-211, 2005. [PDF]
Hiroko Terasawa, Malcolm Slaney, Jonathan Berger. "The thirteen
colors of timbre." Proceedings of the 2005 IEEE Workshop on
Applications of Signal Processing to Audio and Acoustics, New
Paltz, NY, October 16-19, 2005. [PDF]
Grace Crowder, Sterling Foster, Daniel M. Russell, Malcolm Slaney,
Lisa Yanguas. "Analytic worksheets: A framework to support human
analysis of large streaming data volumes." Proceedings of
Interact 2005, Rome, Italy, September 12-16, 2005. [PDF]
A. Ihlefeld, and Malcolm Slaney. "The Story of AudioSapiana." The
Neuromorphic Engineer,
http://www.ine-news.org/view.php?source=0029-2005-12-1, 2005. [Link]
Hiroko Terasawa, Malcolm Slaney, Jonathan Berger. "A Timbre Space
for Speech." Proceedings of InterSpeech 2005, Lisbon,
Portugal, September 4-8, 2005. [PDF]
Hiroko Terasawa, Malcolm Slaney, Jonathan Berger. "Perceptual
Distance in Timbre Space." Japan Acoustical Society, August,
2005, (in Japanese.) [PDF]
Hiroko Terasawa, Malcolm Slaney, Jonathan Berger. "Perceptual
distance in timbre space." Proceedings of International
Conference on Auditory Display (ICAD) 2005, Limerick,
Ireland, July 6-9, 2005. [PDF]
Daniel M. Russell, Malcolm Slaney, Yan Qu, Mave Houston. "A Cost
Structure Analysis of Manual and Computer-supported Sensemaking
Behavior." Proceedings of Intelligence Analysis 2005,
McLean, VA, May 2-6, 2005. [PDF]
Malcolm Slaney and D. M. Russell. "Measuring information
understanding in large document collections." HICSS ’05.
Proceedings of the 38th Annual Hawaii International Conference on
System Sciences, Page(s):105-105, Jan. 03-06, 2005. [PDF]
2004
Nima Mesgarani, Malcolm Slaney, Shihab Shamma. "Speech
discrimination based on multiscale spectro-temporal modulations." Proceedings
IEEE International Conference on Acoustics, Speech, and Signal
Processing. (ICASSP '04), Page(s):I-601-604, May 17-21, 2004.
[PDF]
Sourabh Ravindran, David Anderson, Malcolm Slaney. "Low-power Audio
Classification for Ubiquitous Sensor Networks." Proceedings IEEE
International Conference on Acoustics, Speech, and Signal
Processing. (ICASSP '04), Page(s):iv-337-340, May 17-21 2004.
[PDF]
2003
Malcolm Slaney, Dulce Ponceleon, James Kaufman. "Understanding the
Semantics of Media." In Video Mining, A. Rosenfeld, D.
Doermann, D. DeMenthon (editors). Kluwer Academic Publishers,
Boston, 2003. [PDF]
Malcolm Slaney, Jayashree Subrahmonia, Paul Maglio. "Modeling
Multitasking Users." Published in Spring-Verlag Lecture Notes in
Artificial Intelligence, UM2003 User Modeling: Proceedings of
the Ninth International Conference, July 2003. [PDF]
2002
Malcolm Slaney, Gerald McRoberts. "BabyEars: A Recognition System
for Affective Vocalizations." Speech Communication, 2002. [PDF]
Malcolm Slaney. "Mixtures of Probability Experts for Audio Retrieval
and Indexing." Proceedings of the International Conference on
Multimedia and Expo, Lausanne, Switzerland, August 2002. [PDF]
Malcolm Slaney. "Semantic-Audio retrieval." Invited paper in Proceedings
of 2002 International Conference on Acoustics, Speech and Signal
Processing, Orlando, CA, May 2002. [PDF]
2001
Malcolm Slaney, Dulce Ponceleon, James Kaufmann. "Multimedia edges:
Finding Hierarchy in all Dimensions." Proceedings ACM Multimedia
Conference, Los Angeles, CA, October 2001. [Link]
Malcolm Slaney and Dulce Ponceleon. "Hierarchical segmentation using
latent semantic indexing in scale space." Proceedings of the
2001 International Conference on Acoustics, Speech and Signal
Processing, Salt Lake City, UT, May 2001. [PDF]
Michele Covell, Malcolm Slaney and Art Rothstein. "FastMPEG:
Time-scale Modification of Bit-compressed Audio Information." Proceedings
of the 2001 International Conference on Acoustics, Speech and
Signal Processing, Salt Lake City, UT, May 2001. [PDF]
Malcolm Slaney, Dulce Ponceleon. "Hierarchical segmentation: Finding
changes in a Text Signal." Proceedings of the SIAM Text Mining
2001 Workshop, Chicago, IL, pp. 6-13, April 7, 2001 [PDF]
Malcolm Slaney, Michele Covell. "FaceSync: A Linear Operator for
Measuring Synchronization of Video Facial Images and Audio Tracks."
In Advances in Neural Information Processing Systems 13,
edited by Leen, Todd K., Dietterich, Thomas G. and Tresp, Volker,
MIT Press, 2001. [PDF]
Steve Greenberg, Malcolm Slaney (editors). Computational Models
of Auditory Function. IOS Press, Amsterdam, 2001.
[Full Text as PDF]
2000
Malcolm Slaney, Better CHI with Signal Computation, Talk at
Stanford PCD Seminar, CS 547, January 7, 2000. [link]
Malcolm Slaney and Michele Covell, "Matlab Multidimensional Scaling
Tools," Interval Technical Report #2000-025, 2000. [link]
1998
Malcolm Slaney. "Auditory Toolbox," Interval Research Technical
Report 1998-010, 1998. [Web
site for software]
Michele Covell, Margaret Withgott, and Malcolm Slaney. "Mach1:
Nonuniform Time-scale Modification of Speech," Proceedings IEEE
International Conference on Acoustics, Speech, and Signal
Processing, Seattle WA, vol. 1, pp. 349-352, May 12-15 1998.
[PDF,
Technical Report with audio]
Christoph Bregler, Michele Covell, Malcolm Slaney. "Video Rewrite:
Photorealistic Synthetic Lip Sync." Proceedings INA Imagina,
Monte Carlo, Monaco, pp.193-203, March 4-6 1998, (invited). [PDF]
Malcolm Slaney and Gerald McRoberts. "BabyEars: A Recognition System
for Affective Vocalizations." Proceedings of the 1998
International Conference on Acoustics, Speech, and Signal
Processing, Seattle, WA, vol. 2, pp. 985-988, May 12-15,
1998. [PDF]
Malcolm Slaney. "Connecting Correlograms to Neurophysiology and
Psychoacoustics." In Psychophysical and Physiological Advances
in Hearing, A.R. Palmer, A. Rees, A.Q. Summerfield and R.
Meddis (editors). Whurr Publishers, London, 1998. [PDF]
Malcolm Slaney. "A Critique of Pure Audition." In Computational
Auditory Scene Analysis, David F. Rosenthal , Hiroshi G.
Okuno (editors). Erlbaum, Mahwah, N.J. 1998. [ PDF
or Web
site]
1997
Christoph Bregler, Michele Covell, Malcolm Slaney. "Video Rewrite:
Visual speech Synthesis from Video." Proceedings the 1997 ACM
SIGGRAPH, Los Angeles, pp. 353-360, August 1997. [PDF
and paper
with samples]
Eric Scheirer, Malcolm Slaney. "Construction and Evaluation of a
Robust Multifeature Speech/music Discriminator." Proceedings of
the 1997 IEEE International Conference on Acoustics, Speech and
Signal Processing, vol. 2, pp. 1331-1334, April 1997. [PDF]
1996
Christoph Bregler, Stephen Omohundro, Michele Covell, Malcolm
Slaney, Subutai Ahmad, David A.Forsyth, Jerry A. Feldman.
"Probabilistic Models of Verbal and Body Gestures." In Computer
Vision in Man-Machine Interfaces, R. Cipolla, A.Pentland
(editors). Cambridge University Press, Cambridge, UK, 1996. [PDF]
1995
Malcolm Slaney. "Pattern Playback in the ’90s." in Advances in
Neural Information Processing Systems 7, Gerald Tesauro, David
Touretzky,Todd Leen (editors). MIT Press, Cambridge, MA, pp.
827-834, 1995. [PDF]
Malcolm Slaney. "Pattern Playback from 1950 to 1995." 1995 IEEE
International Conference on Systems, Man and Cybernetics,
Seattle, Vol. 4, pp. 3519-3524, Oct. 1995. [Link]
Chris Bregler and Malcolm Slaney, "Snakes-A MatLab MEX file to
demonstrate snake contour-following," Interval Technical Report
#1995-017, 1995. [Link
to code]
Malcolm Slaney. "A Critique of Pure Audition." Proceedings of the
Computational Auditory Scene Analysis Workshop, 1995
International Joint Conference on Artificial Intelligence, Montreal,
Canada, August 19-20, 1995. [Link]
Malcolm Slaney, Michele Covell, Bud Lassiter. "Automatic Audio
Morphing." Proceedings of the 1995 International Conference on
Acoustics Speech and Signal Processing, Atlanta, GA, vol. 2, pp.
1001-1004, May 1995. [PDF or paper
with sound examples]
1994
Malcolm Slaney, Daniel Naar, Richard F. Lyon. "Auditory Model
Inversion for Sound Separation." Proceedings of the 1994
International Conference on Acoustics Speech and Signal
Processing, Adelaide, SA, Australia, vol. II, pp. 77-80, April
1994. [PDF]
Malcolm Slaney. "An Introduction to Auditory Model Inversion."
Invited talk at the 1994 ATR Workshop on a Biological Framework for
Speech Perception and Production, Kyoto Japan, September 16-17,
1994. [Link
to paper and sound examples]
1993
Malcolm Slaney, Richard F. Lyon. "On the Importance of Time: A
Temporal Representation of Sound." In Visual Representations of
Speech, Martin Cooke, Steve Beet, Malcolm Crawford (editors).
J. Wiley, New York, pp. 95-116, 1993. [PDF]
Malcolm Slaney. "An Efficient Implementation of the
Patterson-Holdsworth Auditory Filter Bank." Apple Computer Technical
Report #35, Apple Computer, Inc., Cupertino, CA, 1993. [Mathematica
notebook and PDF]
Malcolm Slaney. "A Review of Filter Design." Apple Computer
Technical Report #34, Apple Computer, Inc., Cupertino, CA, 1993. [Mathematica
notebook and PDF]
1992
Malcolm Slaney. "Interactive Signal-processing Documents." In Symbolic
and Knowledge-Based Signal Processing, Alan V. Oppenheim, S.
Hamid Nawab (editors) Prentice Hall, Englewood Cliffs, NJ, pp.
173-204, 1992. [PDF]
Malcolm Slaney. "On the Importance of Time." Invited talk at the
1992 Workshop on Music Representations, Capri, Italy, October 1992.
Malcolm Slaney. "On the Importance of Time---A Temporal Representation
of Sound." Invited talk at the ESCA Workshop on Visual
Representations of Speech, Sheffield, England, April 1992.
Malcolm Slaney. "Ear." Published as the SPECfp92 benchmark "ear,"
1992. [Link]
Malcolm Slaney. "MacEar: A program that implements a cochlear
model." [Code]
1991
Mark Fanty, Ron Cole, Malcolm Slaney. "A Comparison of DFT, PLP and
Cochleagram for Alphabet Recognition." Conference Record of the
Twenty-Fifth Asilomar Conference on Signals, Systems and
Computers, Pacific Grove, CA, vol.1, pp. 326-329, November
1991. [PDF]
Malcolm Slaney, Richard. F. Lyon. "Apple Hearing Demo Reel." Apple
Computer Technical Report #25, Apple Computer, Inc., Cupertino, CA,
1991. [New
web site or original
PDF]
1990
Malcolm Slaney. "Interactive Signal Processing Documents." IEEE
ASSP Magazine, pp. 8-20, April 1990. [PDF]
Yeshwant K. Muthusamy, Ron Cole, Malcolm Slaney.
"Speaker-independent Vowel Recognition: Spectrograms versus
Cochleagrams." Proceedings of the 1990 International Conference
on Acoustics, Speech, and Signal Processing, Albuquerque, NM,
vol. 5, pp. 533-536, April 1990. [PDF]
Richard Duda, Richard Lyon, Malcolm Slaney. "Correlograms and the
Separation of Sounds." Conference Record: Twenty-Fourth Asilomar
Conference on Signals, Systems, and Computers, Pacific Grove,
CA, pp. 457-461, November 1990. [PDF]
Malcolm Slaney, Richard F. Lyon. "A Perceptual Pitch Detector." Proceedings
of the 1990 International Conference on Acoustics, Speech, and
Signal Processing, Albuquerque, NM, vol. 1, pp. 357-360, April
1990. [PDF]
1989
Mike Obermier, Gus Pabon, Malcolm Slaney, Larry Yaeger, Steve Nowlan.
"Supercomputer research and engineering applications at Apple." Cray
Channels, vol.11, no.3, p. 6-11, Fall 1989.
[Large PDF, 100M]
Malcolm Slaney. "Implementing Cochlear Models." Invited talk at 1989
IEEE Asilomar Microprocessor Workshop, Asilomar Conference Center,
Pacific Grove, CA, April 27, 1989.
Malcolm Slaney. "How Apple uses a CRAY Supercomputer to Design
Personal Computers." Invited talk at Cray Research Japan Ltd.
Technical Seminar, Tokyo, Japan, April 18, 1989.
1988
A. C. Kak, Malcolm Slaney. Principles of Computerized
Tomographic Imaging. IEEE Press, New York, 1988. Republished
by the Society of Industrial and Applied Mathematics (SIAM) in their
series "Classics in Applied Mathematics," 2001. This book is
available on the Internet at http://www.slaney.org/pct.
[Code]
Malcolm Slaney. "Lyon's Cochlear Model." Apple Computer Technical
Report #13, Apple Computer, Inc., Cupertino, CA, November 1988. [Mathematica
notebook or PDF]
Malcolm Slaney. "SunTroff: A program for displaying device
independent TROFF on a Sun Workstation."" Published on the 1988 Sun
Users Group Tape. Basis for the Linux gxditview
tool.
1986
Malcolm Slaney, Mani Azimi, A. C. Kak, Lawrence. E. Larson.
"Microwave Imaging with First-order Diffraction Tomography." In Medical
Applications of Microwave Imaging, L. E. Larsen, J. H. Jacobi
(editors). IEEE Press, New York, pp. 184-212, 1986. [PDF]
1985
Malcolm Slaney, A. C. Kak. "Imaging with Higher Order Diffraction
Tomography." Proceedings of the IEEE 1985 Ultrasonics Symposium,
San Francisco, vol. 2, pp. 808-813, 1985. [PDF]
Malcolm Slaney. "Imaging with Diffraction Tomography." PhD
Dissertation, Purdue University, 1985. [PDF
or code]
1984
Malcolm Slaney, A. C. Kak, L.E. Larson. "Limitations of Imaging
with First-order Diffraction Tomography." IEEE Transactions on
Microwave Theory and Techniques, vol. MTT-32, pp. 860-873,
August 1984. [PDF]
1983
Malcolm Slaney, A. C. Kak. "Diffraction Tomography." Inverse
Optics: Proceedings of the SPIE, Vol. 412, Arlington, VA, pp.
2-19, April 1983. [PDF]
Carl Crawford, Mani Azimi, Malcolm Slaney.
"The CRC Plotting Package."
Department of Electrical and Computer Engineering Technical Report #527, 10-1-1984.
[PDF>]
A. C. Kak, Robert. J. Safranek, Malcolm Slaney, Marc Andersen.
"Depth Perception for Robot Vision: A Survey of Competing
Technologies." Conference on Artificial Intelligence, Rochester, MI,
1983.
1982
A. C. Kak, Mani Azimi, Malcolm Slaney. "Estimation of Porosity in
Composites." Review of Progress in Quantitative NDE Conference, San
Diego, CA, vol. 2a, pp. 851-866, August 1982. [PDF]