We recently asked the speakers of MLconf NYC 2016 to share their favorite papers with the MLconf audience. We hope you find this list interesting and educational!
Kaheer Suleman, CTO, Maluuba
Pointer Networks
Oriol Vinyals, Meire Fortunato, Navdeep Jaitly
http://arxiv.org/abs/1506.03134
Grammar as a Foreign Language
Oriol Vinyals, Lukasz Kaiser, Terry Koo, Slav Petrov, Ilya Sutskever, Geoffrey Hinton
http://arxiv.org/abs/1412.7449
Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models
Iulian V. Serban, Alessandro Sordoni, Yoshua Bengio, Aaron Courville, Joelle Pineau
http://arxiv.org/abs/1507.04808
Skip-Thought Vectors
Ryan Kiros, Yukun Zhu, Ruslan Salakhutdinov, Richard S. Zemel, Antonio Torralba, Raquel Urtasun, Sanja Fidler
http://arxiv.org/abs/1506.06726
Minimally Constrained Multilingual Word Embeddings via Artificial Code Switching, Michael Wick, Pallika Kanani, Adam Pocock
https://blogs.oracle.com/IRML/entry/minimally_constrained_word_embeddings_via

Samantha Kleinberg, Assistant Professor of Computer Science, Stevens Institute of Technology
Deming, data and observational studies
Young, S. Stanley, and Alan Karr, Significance 8.3 (2011): 116-120.
Homophily and contagion are generically confounded in observational social network studies
Shalizi, Cosma Rohilla, and Andrew C. Thomas., Sociological methods & research 40.2 (2011): 211-239.
How to grow a mind: Statistics, structure, and abstraction. Science
Tenenbaum, J. B., Kemp, C., Griffiths, T. L., & Goodman, N. D. (2011), 331(6022):1279–1285.
(computational models of cognition, which give some inspiration to machine learning methods)
Personalized nutrition by prediction of glycemic responses
Zeevi, David, et al. Cell 163.5 (2015): 1079-1094.
Ike Nassi, Founder, TidalScale
Computing Marginals Using MapReduce
Ullman, et. al.
http://arxiv.org/abs/1509.08855
Framework for an In-depth Comparison of Scale-up and Scale-out
Sevilla, Ioannidou, Nassi, et. al.
https://issdm.soe.ucsc.edu/sites/default/files/sevilla-discs13.pdf
Lei Yang, Senior Engineering Manager, Quora
Hidden technical debt in machine learning systems
Mastering the game of Go with deep neural networks and tree search
Predictability of popularity
Jennifer Marsman, Principal Developer Evangelist, Microsoft
Paper behind the Emotion Detection API in Microsoft Cognitive Services
Deep Neural Decision Forests. [Winner of the David Marr Prize 2015]
Microsoft Research publications

Damien Lefortier, Senior Machine Learning Engineer and Tech Lead in the Prediction Machine Learning team, Criteo
Simple and scalable response prediction for display advertising
O. Chapelle et al.
One-Pass Ranking Models for Low-Latency Product Recommendations
A. Freno et al.
Ad Click Prediction: a View from the Trenches
H. B. McMahan et al.

Yael Elmatad, Senior Data Scientist, Tapad
Finding Connected Components in Map-Reduce in Logarithmic Rounds
http://arxiv.org/pdf/1203.5387.pdf
College Admissions and the Stability of Marriage
Gale, D.; Shapley, L. S. (1962). American Mathematical Monthly 69: 9–14. doi:10.2307/2312726. JSTOR 2312726.