PUBLICATIONS

·  2009

  • S. Datta. C. Giannella, H. Kargupta, "Approximate Distributed K-Means Clustering Over a Peer-to-Peer Network", IEEE Transactions on Knowledge and Data Engineering, to appear.
  • K. Das, K. Bhaduri, S. Arora, W. Griffin, K. Borne, C. Giannella, H. Kargupta, "Scalable Distributed Change Detection From Astronomy Streams Using Local, Asynchronous Eigen Monitoring Algorithms", SIAM Conference on Data Mining (SDM), 2009.

·  2008

  • K. Bhaduri, R. Wolff, C. Giannella, H. Kargupta, "Distributed Decision Tree Induction in Peer-to-Peer Systems", Statistical Analysis and Data Mining , 1(2), 2008.
  • K. Liu, C. Giannella, H. Kargupta, "A Survey of Attack Techniques on Privacy-Preserving Data Perturbation Methods", Chapter in "Privacy-Preserving Data Mining: Models and Algorithms.", Series: Advances in Database Systems vol. 34, Springer, 2008.

·  2007

  • H. Dutta, C. Giannella, K. Borne, H. Kargupta, "Distributed Top-k Outlier Detection from Astronomy Catalogs using the DEMAC System", Proceedings of the SIAM Conference on Data Mining (SDM'07), 2007.

·  2006

  • S. Datta, K. Bhaduri, C. Giannella, R. Wolff, H. Kargupta, "Distributed Data Mining in Peer-to-Peer Networks", IEEE Internet Computing , pages 18-26, July/August 2006.
  • K. Liu, C. Giannella, H. Kargupta, "An Attacker's View of Distance Preserving Maps For Privacy Preserving Data Mining", Proceedings of the 10th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD), 2006. Lecture Notes in Computer Science, volume 4213, pages 297-308.
  • J. Branch, C. Giannella, B. Szymanski, R. Wolff, H. Kargupta, "In-Network Outlier Detection in Wireless Sensor Networks", Proceedings of the 26th International Conference on Distributed Computing Systems (ICDCS), 2006.
  • C. Giannella, H. Dutta, S. Mukherjee, and H. Kargupta, "Efficient Kernel Density Estimation Over Distributed Data", Proceedings of 9th International Workshop on High Performance and Distributed Mining, as part of the SIAM International Conference on Data Mining (SDM), 2006.
  • C. Giannella, H. Dutta, K. Borne, R. Wolff, and H. Kargupta, "Distributed Data Mining for Astronomy Catalogs", Proceedings of 9th Workshop on Mining Scientific and Engineering Datasets, as part of the SIAM International Conference on Data Mining (SDM), 2006.
  • S. Datta, C. Giannella, and H. Kargupta, "K-Means Clustering over a Large, Dynamic Network", Proceedings of the SIAM Conference on Data Mining (SDM'06), 2006.
  • S. Bandyopadhyay, C. Giannella, U. Maulik, H. Kargupta, K. Liu, and S. Datta. "Clustering Distributed Data Streams in Peer-to-Peer Environments", Information Sciences , 176(14), 1952-1985,2006.

·  2005

  • J. da Silva, C. Giannella, R. Bhargava, H. Kargupta, M. Klusch, "Distributed Data Mining and Agents", Engineering Applications of Artificial Intelligence , 18, 791-807, 2005.
  • C. Giannella, B. Sayrafi, “An Information Theoretic Histogram for One-Dimensional Selectivity Estimation”, Proceedings of the ACM Symposium on Applied Computing (ACM SAC 2005) DTTA track (short paper).  Extended version: Technical Report 584, Computer Science Department, Indiana University, download: www.cs.indiana.edu/ftp/techreports/index.html.

·  2004

·  2003

·  2002 and earlier

SOFTWARE AND OTHER LINKS

·  Market-basket synthetic data generator (from IBM Quest), here.

·  Research presentation on data mining over a large, dynamic network (power-point slides).

·  Research presentation on privacy preservng data mining -- Euclidean Distance Preserving Data Perturbation (power-point slides).

·  High Support Itemset Presentation (Short) .