The Workshops on Algorithms for Modern Massive Data Sets (MMDS 2010) addressed algorithmic and statistical challenges in modern large-scale data analysis. The goals of this series of workshops are to explore novel techniques for modeling and analyzing massive, high-dimensional, and nonlinearly-structured scientific and internet data sets; and to bring together computer scientists, statisticians, mathematicians, and data analysis practitioners to promote the cross-fertilization of ideas.
Time | Talk |
---|---|
8:00 - 10:00 | Breakfast and Registration -- outside Cubberley Auditorium (at the Stanford School of Education, just off the Main Quad) |
9:45 - 10:00 | Welcome and Opening Remarks -- in Cubberley Auditorium |
10:00 - 11:00 | Tutorial: Peter Norvig Internet-Scale Data Analysis |
11:00 - 11:30 | Ashok Srivastava Virtual Sensors and Large-Scale Gaussian Processes |
11:30 - 12:00 | John Langford A Method for Parallel Online Learning |
2:00 - 3:00 | Tutorial: John Gilbert Combinatorial Scientific Computing: Experience and Challenges |
3:00 - 3:30 | Deepak Agarwal Recommender Probems for Content Optimization |
3:30 - 4:00 | James Demmel Minimizing Communication in Linear Algebra |
4:30 - 5:00 | Dmitri Krioukov Hyperbolic Mapping of Complex Networks |
5:00 - 5:30 | Mehryar Mohri Matrix Approximation for Large-Scale Learning |
5:30 - 6:00 | David Bader Massive-Scale Analytics of Streaming Social Networks |
6:00 - 6:30 | Ely Porat Fast Pseudo-Random Fingerprints |
Time | Talk |
---|---|
9:00 - 10:00 | Tutorial: Peter Bickel Statistical Inference for Networks |
10:00 - 10:30 | Jure Leskovec Inferring Networks of Diffusion and Influence |
11:00 - 11:30 | Michael W. Mahoney Geometric Network Analysis Tools |
11:30 - 12:00 | Edward Chang AdHEat - A New Influence-based Social Ads Model and its Tera-Scale Algorithms |
12:00 - 12:30 | Mauro Maggioni Intrinsic Dimensionality Estimation and Multiscale Geometry of Data Sets |
2:30 - 3:00 | Guillermo Sapiro Collaborative Hierarchical Sparse Models |
3:00 - 3:30 | Alekh Agarwal and Peter Bartlett Information-theoretic Lower Bounds on the Oracle Complexity of Convex Optimization |
3:30 - 4:00 | John Duchi and Yoram Singer Composite Objective Optimization and Learning for Massive Datasets |
4:30 - 5:00 | Steven Hillion MAD Analytics in Practice |
5:00 - 5:30 | Matthew Harding Outlier Detection in Financial Trading Networks |
5:30 - 6:00 | Neel Sundrahan Large Dataset Problems at the Long Tail |
Time | Talk |
---|---|
9:00 - 10:00 | Tutorial: Sebastiano Vigna Spectral Ranking |
10:00 - 10:30 | Robert Stine Streaming Feature Selection |
11:00 - 11:30 | Konstantin Mischaikow A Combinatorial Framework for Nonlinear Dynamics |
11:30 - 12:00 | Alfred Hero Sparse Correlation Screening in High Dimension |
12:00 - 12:30 | Susan Holmes Heterogeneous Data Challenge Combining Complex Data |
2:30 - 3:30 | Tutorial: Piotr Indyk Sparse Recovery Using Sparse Matrices |
3:30 - 4:00 | Sayan Mukherjee Efficient Dimension Reduction on Massive Data |
4:30 - 5:00 | Padhraic Smyth Statistical Modeling of Large-Scale Sensor Count Data |
5:00 - 5:30 | Ping Li Compressed Counting and Application in Estimating Entropy of Data Steams |
5:30 - 6:00 | Edo Liberty Scaleable Correlation Clustering Algorithms |
Time | Talk |
---|---|
9:00 - 10:00 | Tutorial: Petros Drineas Randomized Algorithms in Linear Algebra and Large Data Applications |
10:00 - 10:30 | Gunnar Martinsson Randomized methods for Computing the SVD/PCA of Very Large Matrices |
11:00 - 11:30 | Ilse Ipsen Numerical Reliability of Randomized Algorithms |
11:30 - 12:00 | Philippe Rigollet Optimal Rates of Sparse Esimation and Universal Aggregation |
12:00 - 12:30 | Alexandre d'Aspremont Subsampling, Spectral Methods & Semidefinite Programming |
2:30 - 3:00 | Gary Miller Specialized System Solvers for very large Systems: Theory and Practice |
3:00 - 3:30 | John Wright and Emmanuel Candes Robust Principal Component Analysis? |
3:30 - 4:00 | Alon Orlitsky Estimation, Prediction, and Classification over Large Alphabets |
4:30 - 5:00 | Ken Clarkson Numerical Linear Algebra in the Streaming Model |
5:00 - 5:30 | David Woodruff Fast Lp Regression in Data Streams |
Alekh Agarwal | University of California, Berkeley |
Deepak Agarwal | Yahoo! Research |
Alexandre d'Aspremont | Princeton University |
David Bader | Georgia Tech College of Computing |
Peter Bickel | University of California, Berkeley |
Emmanuel Candes | Stanford University |
Edward Chang | Google Research |
Ken Clarkson | IBM Almaden Research Center |
Jim Demmel | University of California, Berkeley |
John Duchi | University of California, Berkeley |
John Gilbert | University of California, Santa Barbara |
Matthew Harding | Stanford University |
Alfred Hero | University of Michigan, Ann Arbor |
Steven Hillion | Greenplum |
Susan Holmes | Stanford University |
Peter Indyk | Massachusetts Institute of Technology |
Ilse Ipsen | North Carolina State University |
Dmitri Krioukov | Cooperative Association for Internet Data Analysis |
John Langford | Yahoo! Research |
Jure Leskovec | Stanford University |
Ping Li | Cornell University |
Edo Liberty | Yahoo! Research |
Mauro Maggioni | Duke University |
Gunnar Martinsson | University of Colorado, Boulder |
Gary Miller | Carnegie Mellon University |
Konstantin Mischaikow | Rutgers University |
Mehryar Mohri | New York University |
Sayan Mukherjee | Duke University |
Peter Norvig | Google Research |
Alon Orlitsky | University of California, San Diego |
Ely Porat | Bar-Ilan University |
Guillermo Sapiro | University of Minnesota |
Padhraic Smyth | University of California, Irvine |
Ashok Srivastava | National Aeronautics and Space Administration |
Neel Sundaresan | eBay Research |
Robert Stine | University of Pennsylvania |
Sebastiano Vigna | Università Degli Studi Di Milano |
David Woodruff | IBM Almaden Research Center |
John Wright | Microsoft Research Asia |
Peter Bartlett | University of California, Berkeley |
Robert Calderbank | Princeton University |
Fan Chung | University of California, San Diego |
Yoram Singer | Google Research |
Patrick Wolfe | Harvard University |
MMDS 2008. Workshop on Algorithms for Modern Massive Data Sets, Stanford, CA, June 25–28, 2008.
MMDS 2006. Workshop on Algorithms for Modern Massive Data Sets, Stanford, CA, June 21–24, 2006.