My Data Mining Page
Papers
- Sagi Shporer, AIM2: Improved Implementation of AIM,
Workshop on Frequent Itemset Mining Implementations, (with ICDM'04), Brighton, UK, November 2004. (PDF)
- Amos Fiat and Sagi Shporer, AIM-F: Another Itemset Miner,
Workshop on Frequent Itemset Mining Implementations, (with ICDM'03), Melbourne, FL, November 2003. (PDF)
- Sagi Shporer, M.sc thesis, Extending the Order Preserving Submatrix: New patterns in datasets, Tel-Aviv University, 2004 (PDF)
My Software
- AIM - Frequent
itemset miner.
- OPSM-G : OPSM Miner ( Source & Executable ). Writen in C# compatable for Windows with .NET Framework 1.0 and above.
Datasets
All non-biological datasets can be found at FIMI workshop page . Biological dataset can be found in many
places but not in a standard format. List below are the dataset in the format suggested in the FIMI workshop.
- Breast cancer - Hedenfalk et. al (2001), Gene-expression profiles in hereditary breast cancer, N Engl J Med 2001 Feb 22;344(8):539-48
- (Project homepage)
- Colon cancer - Alon et al. (1999),
Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays
, Proc. Natl. Acad. Sci. USA. 96 (1999) 6745-6750. (Lab homepage)
- MLL Leukemia - Armstrong et al (2002), MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia, Nature Genetics 30, pp 41 - 47 (2002) (Homepage)
- AML-ALL Leukemia - Golub et al. (1999), Molecular classification of cancer: class discovery and class prediction by gene expression monitoring , Science 1999 286:531-537. (Homepage)
- Central Nervous System Embryonal Tumor - Pomeroy et al. (2002), Prediction of Central Nervous System Embryonal Tumour Outcome Based on Gene Expression, Nature (Letters to Nature), Vol. 415, 436-442, 2002 (Center homepage)
Related Links
General purpose datasets:
Gene Expression datasets repositories:
Algorithms & Source Code
- FIMI Homepage - The FIMI workshop homepage. Contains source code for many
state-of-the-art frequent itemset mining algorithms.