Research Seminar (Fall 2002):

Seminar Schedule (tentative)

 

October 13Yossi Matias – Introduction

 

October 20Yossi Matias – Introduction (cont.)

 

October 27Boris Litvin – Online Profiling:

 

·        Efficient and flexible Value Sampling, M. Burrows, U. Erlingson, S.-T. Leung, M.T. Vandevoorde, C.A. Waldspurger, K. Walker, W.E. Weihl, Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2000

·        Rapid Profiling via Stratified Sampling, S.S. Sastry, R. Bodk, and J. Smith, International Symposium on Computer Architecture (ISCA), 2001

·        Online subpath profiling, D. Oren, Y. Matias, M. Sagiv, International Conference on Compiler Construction (CC), 2002

 

November 3 Iftach Ragoler – Sensor Networks

 

·        TAG: a Tiny Aggregation Service for Ad-Hoc Sensor Networks, Samuel Madden, Michael Franklin, Joseph Hellerstein, and Wei Hong, OSDI 2002

·        Supporting Aggregate Queries Over Ad-Hoc Wireless Sensor Networks, Sam Madden, Robert Szewczyk, Michael Franklin, and David Culler, 4th IEEE Workshop on Mobile Computing Systems & Applications, June 2002.

·        Directed diffusion: A scalable and robust communication paradigm for sensor networks, Chalermek Intanagonwiwat, Ramesh Govindan and Deborah Estrin, Proceedings of the Sixth Annual International Conference on Mobile Computing and Networking (MobiCOM '00), August 2000

 

November 10Michael Furman – List Traversal Synopses

 

·        List traversal synopsis – with applications, Y. Matias and E. Porat.

 

November 17Michael Berezansky – Clustering

 

·        S-Tree: self-organizing three for data clustering and online vector quantization, Marcos M. Campos, Gail A. Carpenter, Neural Networks 14(2001) 505-525

·        Fast Hierarchical Clustering and Other Applications of Dynamic Closest Pairs , David Epstein, SODA 1998 

·        Rock: A robust clustering algorithm for categorical attributes, Guha Sudipto, Rastogi Rajeev, and Shim Kyuseok, Proceedings of the IEEE International Conference on Data Engineering, Sydney, March 1999, March 1999

·        Cluster Validity Methods: Part I, M. Halkidi, Y. Batistakis, M. Vazirgiannis, SIGMOD Record 31(2), 40-45

 

November 24Saar Cohen – Fast filtering and lookup on streaming data

 

·        Computing Iceberg Queries Efficiently, Fang, Min; Shivakumar, Narayanan; Garcia-Molina, Hector; Motwani, Rajeev; Ullman, Jeffrey D., International Conference on Very Large Databases (VLDB'98), New York, August 1998

·        New Directions in Traffic Measurement and Accounting, Christian Estan and George Varghese, SIGCOMM 2002.

 

December 1 – Hanuka – no seminar

 

December 8 - Leon Portman – Wavelet synopses

 

·        Wavelet Synopses with Error Guarantees, Minos Garofalakis and Phillip B. Gibbons. Proceedings of ACM SIGMOD'2002, Madison, Wisconsin, June 2002, pp. 476-487.

·        Workload-based Wavelet Synopses, Yossi Matias and Leon Portman.

 

December 15 – Natasha Kreimer – XML synopses

 

·        Structure and Value Synopses for XML Data Graphs, N. Polyzotis and M. Garofalakis,
Proceedings of the 28th VLDB Conference, Hong Kong, China, 2002

·        XPathLearner: An On-Line Self-Tuning Markov Histogram for XML Path Selectivity Estimation,
L.Lim , M.Wang , S.Padmanabhan, J.Scott Vitter, R. Parr, Proceedings of the 28th VLDB Conference, Hong Kong, China, 2002

·        StatiX: Making XML Count , Juliana Freire ,Jayant R. Haritsa, Maya Ramanath, Prasan Roy,
Jerome Simeon , ACM SIGMOD 2002 June 4-6, Madison, Wisconsin, USA
          

 

December 22 – Roi Barkan – Frequent Items in data streams

 

·        A Simple Algorithm for Finding Frequent Elements in Streams and Bags, R. M. Karp, C. H. Papadimitriou, S. Shenker 

·        Finding Frequent Items in Data Streams, M. Charikar, K. Chen, M. Farach-Colton, In Proceedings of the 29th International Colloquium on Automata Languages and Programming (ICALP), 2002.

·        Approximate Frequency Counts over Data Streams, Gurmeet Singh Manku, Rajeev Motwani, In VLDB 2002.

 

January 5 – Anat Eyal - Object Replication in Data Grids

 

·        An introduction to data acquisition in High Energy Physics (HEP) experiments, specifically at DESY

·        Object replication architecture in the CERN grid project

·        Data Management in an International Data Grid Project,Wolfgang Hoschek, Javier Jaen-Martinez, Asad Samar, Heinz Stockinger, Kurt Stockinger, , IEEE/ACM International Workshop on Grid Computing Grid'2000 - 17-20 December 2000 Bangalore, India "Distinguished Paper" Award

·        File and Object Replication in Data Grids, Heinz Stockinger, Asad Samar, Bill Allcock, Ian Foster, Koen Holtman, Brian Tierney, , 10th IEEE International Symposium on High Performance Distributed Computing (HPDC 2001), San Francisco, California, August 7-9, 2001