We recently asked the speakers of MLconf SEA 2016 to share their favorite papers with the MLconf audience. We hope you find this list interesting and educational!
Avi Pfeffer, Principal Scientist, Charles River Analytics
Kristian Kersting, Associate Professor for Computer Science, TU Dortmund University, Germany
Open-World Probabilistic Databases
Ismail Ilkan Ceylan, Adnan Darwiche, Guy Van den Broeck
http://web.cs.ucla.edu/~guyvdb/papers/CeylanKR16.pdf
Incremental Knowledge Base Construction Using DeepDive
Jaeho Shin, Sen Wu, Feiran Wang, Christopher De Sa, Ce Zhang, Christopher Ré
http://www.vldb.org/pvldb/vol8/p1310-shin.pdf
Revisiting Frank-Wolfe: Projection-Free Sparse Convex Optimization
Martin Jaggi
http://m8j.net/math/revisited-FW.pdf
Deep Symmetry Networks
Robert Gens, Pedro M. Domingos
http://homes.cs.washington.edu/~pedrod/papers/nips14.pdf
Admixture of Poisson MRFs: A Topic Model with Word Dependencies
David I. Inouye, Pradeep Ravikumar, Inderjit S. Dhillon
http://jmlr.org/proceedings/papers/v32/inouye14.pdf
Florian Tramèr, Researcher, EPFL
Adversarial Learning
Lowd & Meek, KDD, 2005
http://research.microsoft.com/pubs/73510/kdd05lowd.pdf
Practical Evasion of a Learning-Based Classifier: A Case Study
Srndic & Laskov, IEEE S&P, 2014
http://www.utdallas.edu/~muratk/courses/dmsec_files/srndic-laskov-sp2014.pdf
Can Machine Learning Be Secure?
Barreno et al, ASIACCS, 2006
http://www.cs.berkeley.edu/~tygar/papers/Machine_Learning_Security/asiaccs06.pdf
Privacy in Pharmacogenetics: An End-to-End Case Study of Personalized Warfarin Dosing
Fredrikson et al, USENIX Security, 2014
https://www.usenix.org/system/files/conference/usenixsecurity14/sec14-paper-fredrikson-privacy.pdf
Jason Baldridge, Associate Professor of Computational Linguistics, University of Texas at Austin
A Supertag-Context Model for Weakly-Supervised CCG Parser Learning
Dan Garrette, Chris Dyer, Jason Baldridge, and Noah Smith. (2015)
https://aclweb.org/anthology/K/K15/K15-1003.pdf
Hierarchical Discriminative Classification for Text-Based Geolocation
Ben Wing and Jason Baldridge. (2014)
http://aclweb.org/anthology/D/D14/D14-1039.pdf
A recursive estimate for the predictive likelihood in a topic model
James Scott and Jason Baldridge
https://github.com/utcompling/topicmodel-eval/blob/master/scott-baldridge-aistats13.pdf?raw=true
Amanda Casari, Senior Data Scientist, Concur Technologies
Clustering of Time Series Subsequences is Meaningless: Implications for Previous and Future Research
Eamonn Keogh & Jessica Lin, Computer Science & Engineering Department University of California – Riverside {eamonn, jessica}@cs.ucr.edu
http://www.cs.ucr.edu/~eamonn/meaningless.pdf
Antisocial Behavior in Online Discussion Communities
Justin Cheng , Cristian Danescu-Niculescu-Mizil , Jure Leskovec, Stanford University, Cornell University
http://arxiv.org/pdf/1504.00680v1.pdf%20
The Parable of Google Flu: Traps in Big Data Analysis
David Lazer, 1, 2 * Ryan Kennedy, 1, 3, 4 Gary King, 3 Alessandro Vespignani 3,5,6 1 Lazer Laboratory, Northeastern University, Boston, MA 02115, USA. 2Harvard Kennedy School, Harvard University, Cambridge, MA 02138, USA. 3 Institute for Quantitative Social Science, Harvard University, Cambridge, MA 02138, USA. 4University of Houston, Houston, TX 77204, USA. 5 Laboratory for the Modeling of Biological and Sociotechnical Systems, Northeastern University, Boston, MA 02115, USA. 6 Institute for Scientifi c Interchange Foundation, Turin, Italy.
http://gking.harvard.edu/files/gking/files/0314policyforumff.pdf
Erin LeDell, h2o.ai
Stacked Regressions
Leo Breiman. (1996)
http://dx.doi.org/10.1007/BF00117832
http://statistics.berkeley.edu/sites/default/files/tech-reports/367.pdf
Scalable Ensemble Learning and Computationally Efficient Variance Estimation (Doctoral Dissertation)
Erin LeDell (2015)
http://www.stat.berkeley.edu/~ledell/papers/ledell-phd-thesis.pdf
Distilling the Knowledge in a Neural Network
Geoffrey Hinton, Oriol Vinyals, Jeff Dean (2015)
http://arxiv.org/abs/1503.02531
Understanding Random Forests: From Theory to Practice (Doctoral Dissertation)
Gilles Louppe (2014)
http://www.montefiore.ulg.ac.be/~glouppe/pdf/phd-thesis.pdf
Generalized Low Rank Models
Madeleine Udell, Corinne Horn, Reza Zadeh, and Stephen Boyd (2014)
http://arxiv.org/abs/1410.0342