default search action
5th SDM 2005: Newport Beach, California, USA
- Hillol Kargupta, Jaideep Srivastava, Chandrika Kamath, Arnold Goodman:
Proceedings of the 2005 SIAM International Conference on Data Mining, SDM 2005, Newport Beach, CA, USA, April 21-23, 2005. SIAM 2005, ISBN 978-0-89871-593-4
Statistics in Data Mining
- Sijin Liu, Xiaotong Shen, Wing Hung Wong:
Computational Developments of ψ-learning. 1-11 - Matthew Brand:
A Random Walks Perspective on Maximizing Satisfaction and Profit. 12-19 - Ronald K. Pearson:
Surveying Data for Patchy Structure. 20-31 - Chris H. Q. Ding, Jieping Ye:
2-Dimensional Singular Value Decomposition for 2D Maps and Images. 32-43
Stream Data Mining
- Graham Cormode, S. Muthukrishnan:
Summarizing and Mining Skewed Data Streams. 44-55 - Charu C. Aggarwal, Philip S. Yu:
Online Analysis of Community Evolution in Data Streams. 56-67 - Chih-Hsiang Lin, Ding-Ying Chiu, Yi-Hung Wu, Arbee L. P. Chen:
Mining Frequent Itemsets from Data Streams with a Time-Sensitive Sliding Window. 68-79 - Charu C. Aggarwal:
On Abnormality Detection in Spuriously Populated Data Streams. 80-91
Privacy Preserving Data Mining
- Zhiqiang Yang, Sheng Zhong, Rebecca N. Wright:
Privacy-Preserving Classification of Customer Data without Loss of Accuracy. 92-102 - Xintao Wu, Ying Wu, Yongge Wang, Yingjiu Li:
Privacy Aware Market Basket Data Set Generation: A Feasible Approach for Inverse Frequent Set Mining. 103-114 - Charu C. Aggarwal, Philip S. Yu:
On Variable Constraints in Privacy Preserving Data Mining. 115-125
Clustering
- David Gondek, Shivakumar Vaithyanathan, Ashutosh Garg:
Clustering with Model-level Constraints. 126-137 - Ian Davidson, S. S. Ravi:
Clustering with Constraints: Feasibility Issues and the k-Means Algorithm. 138-149 - Yu Xia, Jiming Peng:
A Cutting Algorithm for the Minimum Sum-of-Squared Error Clustering. 150-160
Scientific Data Mining
- Sameep Mehta, Steve Barr, Tat-Sang Choy, Hui Yang, Srinivasan Parthasarathy, Raghu Machiraju, John Wilkins:
Dynamic Classification of Defect Structures in Molecular Dynamics Simulation Data. 161-172 - Bavani Arunasalam, Sanjay Chawla, Pei Sun:
Striking Two Birds With One Stone: Simultaneous Mining of Positive and Negative Spatial Patterns. 173-182 - Ata Kabán, Louisa Nolan, Somak Raychaudhury:
Finding Young Stellar Populations in Elliptical Galaxies from Independent Components of Optical Spectra. 183-194
Classifiers and Ensembles
- Qinghua Hu, Daren Yu, Zongxia Xie:
Hybrid Attribute Reduction for Classification Based on A Fuzzy Rough Set Technique. 195-204 - Jianyong Wang, George Karypis:
HARMONY: Efficiently Mining the Best Rules for Classification. 205-216 - Carlotta Domeniconi, Bojun Yan:
On Error Correlation and Accuracy of Nearest Neighbor Ensemble Classifiers. 217-226
Association Rules and Database Issues
- Yiqiu Han, Wai Lam:
Lazy Learning for Classification Based on Query Projections. 227-238 - Bart Goethals, Juho Muhonen, Hannu Toivonen:
Mining Non-Derivable Association Rules. 239-249 - Toon Calders, Bart Goethals:
Depth-First Non-Derivable Itemset Mining. 250-261 - Dmitri V. Kalashnikov, Sharad Mehrotra, Zhaoqi Chen:
Exploiting Relationships for Domain-Independent Data Cleaning. 262-273
Graphs and Graphical Models
- Scott White, Padhraic Smyth:
A Spectral Clustering Approach To Finding Communities in Graph. 274-285 - Chao Liu, Xifeng Yan, Hwanjo Yu, Jiawei Han, Philip S. Yu:
Mining Behavior Graphs for "Backtrace" of Noncrashing Bugs. 286-297 - Tak-Lam Wong, Wai Lam:
Learning to Refine Ontology for a New Web Site Using a Bayesian Approach. 298-309 - Radu Stefan Niculescu, Tom M. Mitchell, R. Bharat Rao:
Exploiting Parameter Related Domain Knowledge for Learning in Graphical Models. 310-321
SVM and Classification
- Navneet Panda, Edward Y. Chang:
Exploiting Geometry for Support Vector Machine Indexing. 322-333 - Shibin Qiu, Terran Lane:
Parallel Computation of RBF Kernels for Support Vector Classifiers. 334-345 - Yun Chi, Philip S. Yu, Haixun Wang, Richard R. Muntz:
Loadstar: A Load Shedding Scheme for Classifying Data Streams. 346-357
Complex Data Types: Text, Images, and Sequences
- Ying Zhao, George Karypis:
Topic-driven Clustering for Document Datasets. 358-369 - Tomás Singliar, Milos Hauskrecht:
Variational Learning for Noisy-OR Component Analysis. 370-379 - Gemma Casas-Garriga:
Summarizing Sequential Data with Closed Partial Orders. 380-391
Statistics in Data Mining
- Kwok Pan Pang:
SUMSRM: A New Statistic for the Structural Break Detection in Time Series. 392-403 - Robert Gwadera, Mikhail J. Atallah, Wojciech Szpankowski:
Markov Models for Identification of Significant Episodes. 404-414 - Congnan Luo, Soon Myoung Chung:
Efficient Mining of Maximal Sequential Patterns Using Multiple Samples. 415-426
Scientific Data Mining
- Naren Ramakrishnan, Chris Bailey-Kellogg, Satish Tadepalli, Varun Pandey:
Gaussian Processes for Active Data Mining of Spatial Aggregates. 427-438 - Xiaoli Zhang Fern, Carla E. Brodley, Mark A. Friedl:
Correlation Clustering for Learning Mixtures of Canonical Correlation Models. 439-448 - Michail Vlachos, Philip S. Yu, Vittorio Castelli:
On Periodicity Detection and Structural Periodic Similarity. 449-460
Poster Papers
- Jian Pei, Moonjung Cho, David Wai-Lok Cheung:
Cross Table Cubing: Mining Iceberg Cubes from Data Warehouses. 461-465 - Amir Bar-Or, Ran Wolff, Assaf Schuster, Daniel Keren:
Decision Tree Induction in High Dimensional, Hierarchically Distributed Databases. 466-470 - Daniel Lemire, Anna Maclachlan:
Slope One Predictors for Online Rating-Based Collaborative Filtering. 471-475 - Murat Dundar, Glenn Fung, Jinbo Bi, Sathyakama Sandilya, R. Bharat Rao:
Sparse Fisher Discriminant Analysis for Computer Aided Detection. 476-480 - Hamad Alhammady, Kotagiri Ramamohanarao:
Expanding the Training Data Space Using Emerging Patterns and Genetic Methods. 481-485 - Wei Fan, Janek Mathuria, Chang-Tien Lu:
Making Data Mining Models Useful to Model Non-paying Customers of Exchange Carriers. 486-490 - Shuting Xu, Jun Zhang:
Matrix Condition Number Prediction with SVM Regression and Feature Selection. 491-495 - Reda Alhajj:
Cluster Validity Analysis of Alternative Results from Multi-Objective Optimization. 496-500 - Kuo-Yu Huang, Chia-Hui Chang, Kuo-Zui Lin:
ClosedPROWL: Efficient Mining of Closed Frequent Continuities by Projected Window List Technology. 501-505 - Chotirat (Ann) Ratanamahatana, Eamonn J. Keogh:
Three Myths about Dynamic Time Warping Data Mining. 506-510 - Effrosini Kokiopoulou, Yousef Saad:
PCA without eigenvalue calculations: a case study on face recognition. 511-515 - Raymond Chi-Wing Wong, Ada Wai-Chee Fu:
Mining Top-K Itemsets over a Sliding Window Based on Zipfian Distribution. 516-520 - Tao Li:
Hierarchical Document Classification Using Automatically Generated Hierarchy. 521-525 - Tao Li:
On Clustering Binary Data. 526-530 - Nitin Kumar, Venkata Nishanth Lolla, Eamonn J. Keogh, Stefano Lonardi, Chotirat (Ann) Ratanamahatana:
Time-series Bitmaps: a Practical Visualization Tool for Working with Large Time Series Databases. 531-535 - Rong She, Ke Wang, Yabo Xu, Philip S. Yu:
Pushing Feature Selection Ahead Of Join. 536-540 - Shiying Huang, Geoffrey I. Webb:
Discarding Insignificant Rules during Impact Rule Discovery in Large, Dense Databases. 541-545 - Himika Biswas, Somnath Pal:
SPID4.7: Discretization Using Successive Pseudo Deletion at Maximum Information Gain Boundary Points. 546-550 - Zheng Sun, Philip S. Yu, Xiang-Yang Li:
Iterative Mining for Rules with Constrained Antecedents. 551-555 - Al Mamunur Rashid, George Karypis, John Riedl:
Influence in Ratings-Based Recommender Systems: An Algorithm-Independent Approach. 556-560 - Gianluigi Greco, Antonella Guzzo, Giuseppe Manco, Domenico Saccà:
Mining Unconnected Patterns in Workflows. 561-565 - Bharath Kumar Mohan:
The Best Nurturers in Computer Science Research. 566-570 - Tsuyoshi Idé, Keisuke Inoue:
Knowledge Discovery from Heterogeneous Dynamic Systems using Change-Point Correlations. 571-575 - Ke Wang, Yabo Xu, Philip S. Yu, Rong She:
Building Decision Trees on Records Linked through Key References. 576-580 - Giuliano Tirenni, Abderrahim Labbi, André Elisseeff, Cesar Berrospi:
Efficient Allocation of Marketing Resources using Dynamic Programming. 581-585 - Haixun Wang, Chang-Shing Perng, Philip S. Yu:
Near-Neighbor Search in Pattern Distance Spaces. 586-590 - Haiyun Bian, Raj Bhatnagar:
An Algorithm for Well Structured Subspace Clusters. 591-595 - Vincent Shin-Mu Tseng, Chao-Hui Lee:
CBS: A New Classification Method by Using Sequential Patterns. 596-600 - Hong Cheng, Xifeng Yan, Jiawei Han:
SeqIndex: Indexing Sequences by Sequential Pattern Analysis. 601-605 - Chris H. Q. Ding, Xiaofeng He:
On the Equivalence of Nonnegative Matrix Factorization and Spectral Clustering. 606-610 - Gang Wu, Zhihua Zhang, Edward Y. Chang:
Kronecker Factorization for Speeding up Kernel Machines. 611-615 - Feng Kang, Rong Jin:
Symmetric Statistical Translation Models for Automatic Image Annotation. 616-620 - Kang Peng, Slobodan Vucetic, Zoran Obradovic:
Correcting Sampling Bias in Structural Genomics through Iterative Selection of Underrepresented Targets. 621-625 - Alina Beygelzimer, Emre Erdogan, Sheng Ma, Irina Rish:
Statictical Models for Unequally Spaced Time Series. 626-630 - Efstratios Gallopoulos, Dimitrios Zeimpekis:
CLSI: A Flexible Approximation Scheme from Clustered Term-Document Matrices. 631-635 - Unil Yun, John J. Leggett:
WFIM: Weighted Frequent Itemset Mining with a weight range and a minimum weight. 636-640 - Martin H. C. Law, Alexander P. Topchy, Anil K. Jain:
Model-based Clustering With Probabilistic Constraints. 641-645
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.