Abstract is missing.
- On-line science: the world-wide telescope as a prototype for the new computational scienceJim Gray. 3 [doi]
- Statistical learning from relational dataDaphne Koller. 4 [doi]
- Analyzing customer behavior at Amazon.comAndreas S. Weigend. 5 [doi]
- Towards systematic design of distance functions for data mining applicationsCharu C. Aggarwal. 9-18 [doi]
- Generative model-based clustering of directional dataArindam Banerjee, Inderjit S. Dhillon, Joydeep Ghosh, Suvrit Sra. 19-28 [doi]
- Mining distance-based outliers in near linear time with randomization and a simple pruning ruleStephen D. Bay, Mark Schwabacher. 29-38 [doi]
- Adaptive duplicate detection using learnable string similarity measuresMikhail Bilenko, Raymond J. Mooney. 39-48 [doi]
- An iterative hypothesis-testing strategy for pattern discoveryRichard J. Bolton, Niall M. Adams. 49-58 [doi]
- Efficient data reduction with EASEHervé Brönnimann, Bin Chen, Manoranjan Dash, Peter J. Haas, Peter Scheuermann. 59-68 [doi]
- Extracting semantics from data cubes using cube transversals and closuresAlain Casali, Rosine Cicchetti, Lotfi Lakhal. 69-78 [doi]
- Translation-invariant mixture models for curve clusteringDarya Chudova, Scott Gaffney, Eric Mjolsness, Padhraic Smyth. 79-88 [doi]
- Information-theoretic co-clusteringInderjit S. Dhillon, Subramanyam Mallela, Dharmendra S. Modha. 89-98 [doi]
- SEWeP: using site semantics and a taxonomy to enhance the Web personalization processMagdalini Eirinaki, Michalis Vazirgiannis, Iraklis Varlamis. 99-108 [doi]
- Inverted matrix: efficient discovery of frequent items in large datasets in the context of interactive miningMohammad El-Hajj, Osmar R. Zaïane. 109-118 [doi]
- To buy or not to buy: mining airfare data to minimize ticket purchase priceOren Etzioni, Rattapoom Tuchinda, Craig A. Knoblock, Alexander Yates. 119-128 [doi]
- Fragments of orderAristides Gionis, Teija Kujala, Heikki Mannila. 129-136 [doi]
- Maximizing the spread of influence through a social networkDavid Kempe, Jon M. Kleinberg, Éva Tardos. 137-146 [doi]
- PROXIMUS: a framework for analyzing very high dimensional discrete-attributed datasetsMehmet Koyutürk, Ananth Grama. 147-156 [doi]
- Visualizing changes in the structure of data for exploratory feature selectionElias Pampalk, Werner Goebl, Gerhard Widmer. 157-166 [doi]
- Aggregation-based feature invention and relational concept classesClaudia Perlich, Foster J. Provost. 167-176 [doi]
- Cross-training: learning probabilistic mappings between topicsSunita Sarawagi, Soumen Chakrabarti, Shantanu Godbole. 177-186 [doi]
- Generating English summaries of time series data using the Gricean maximsSomayajulu Sripada, Ehud Reiter, Jim Hunter, Jin Yu. 187-196 [doi]
- Assessment and pruning of hierarchical model based clusteringJeremy Tantrum, Alejandro Murua, Werner Stuetzle. 197-205 [doi]
- Privacy-preserving ::::k::::-means clustering over vertically partitioned dataJaideep Vaidya, Chris Clifton. 206-215 [doi]
- Indexing multi-dimensional time-series with support for multiple distance measuresMichail Vlachos, Marios Hadjieleftheriou, Dimitrios Gunopulos, Eamonn J. Keogh. 216-225 [doi]
- Mining concept-drifting data streams using ensemble classifiersHaixun Wang, Wei Fan, Philip S. Yu, Jiawei Han. 226-235 [doi]
- CLOSET+: searching for the best strategies for mining frequent closed itemsetsJianyong Wang, Jiawei Han, Jian Pei. 236-245 [doi]
- Mining unexpected rules by pushing user dynamicsKe Wang, Yuelong Jiang, Laks V. S. Lakshmanan. 246-255 [doi]
- On detecting differences between groupsGeoffrey I. Webb, Shane M. Butler, Douglas A. Newlands. 256-265 [doi]
- Algorithms for estimating relative importance in networksScott White, Padhraic Smyth. 266-275 [doi]
- Screening and interpreting multi-item associations based on log-linear modelingXintao Wu, Daniel Barbará, Yong Ye. 276-285 [doi]
- CloseGraph: mining closed frequent graph patternsXifeng Yan, Jiawei Han. 286-295 [doi]
- Eliminating noisy information in Web pages for data miningLan Yi, Bing Liu, Xiaoli Li. 296-305 [doi]
- Classifying large data sets using SVMs with hierarchical clustersHwanjo Yu, Jiong Yang, Jiawei Han. 306-315 [doi]
- XRules: an effective structural classifier for XML dataMohammed Javeed Zaki, Charu C. Aggarwal. 316-325 [doi]
- Fast vertical mining using diffsetsMohammed Javeed Zaki, Karam Gouda. 326-335 [doi]
- Efficient elastic burst detection in data streamsYunyue Zhu, Dennis Shasha. 336-345 [doi]
- Golden Path Analyzer: using divide-and-conquer to cluster Web clickstreamsKamal Ali, Steven P. Ketchpel. 349-358 [doi]
- Empirical Bayesian data mining for discovering patterns in post-marketing drug safetyDavid M. Fram, June S. Almenoff, William DuMouchel. 359-368 [doi]
- Mining hepatitis data with temporal abstractionTu Bao Ho, Trong Dung Nguyen, Saori Kawasaki, Si Quang Le, DucDung Nguyen, Hideto Yokoi, Katsuhiko Takabayashi. 369-377 [doi]
- Information awareness: a prospective technical assessmentDavid Jensen, Matthew J. Rattigan, Hannah Blau. 378-387 [doi]
- The data mining approach to automated software testingMark Last, Menahem Friedman, Abraham Kandel. 388-396 [doi]
- Passenger-based predictive modeling of airline no-show ratesRichard D. Lawrence, Se June Hong, Jacques Cherrier. 397-406 [doi]
- Capturing best practice for microarray gene expression data analysisGregory Piatetsky-Shapiro, Tom Khabaza, Sridhar Ramaswamy. 407-415 [doi]
- Clinical and financial outcomes analysis with existing hospital patient recordsR. Bharat Rao, Sathyakama Sandilya, Radu Stefan Niculescu, Colin Germond, Harsha Rao. 416-425 [doi]
- Critical event prediction for proactive management in large-scale computer clustersRamendra K. Sahoo, Adam J. Oliner, Irina Rish, Manish Gupta, José E. Moreira, Sheng Ma, Ricardo Vilalta, Anand Sivasubramaniam. 426-435 [doi]
- Frequent-subsequence-based prediction of outer membrane proteinsRong She, Fei Chen 0002, Ke Wang, Martin Ester, Jennifer L. Gardy, Fiona S. L. Brinkman. 436-445 [doi]
- Discovery of climate indices using clusteringMichael Steinbach, Pang-Ning Tan, Vipin Kumar, Steven A. Klooster, Christopher Potter. 446-455 [doi]
- Knowledge-based data miningSholom M. Weiss, Stephen J. Buckley, Shubir Kapoor, Søren Damgaard. 456-461 [doi]
- The anatomy of a multimodal information filterYi-Leh Wu, Kingshy Goh, Beitao Li, Huaxin You, Edward Y. Chang. 462-471 [doi]
- Style mining of electronic messages for multiple authorship discrimination: first resultsShlomo Argamon, Marin Saric, Sterling Stuart Stein. 475-480 [doi]
- Mining high dimensional data for classifier knowledgeRaj Bhatnagar, Goutham Kurra, Wen Niu. 481-486 [doi]
- Finding recent frequent itemsets adaptively over online data streamsJoong Hyuk Chang, Won Suk Lee. 487-492 [doi]
- Probabilistic discovery of time series motifsBill Yuan-chi Chiu, Eamonn J. Keogh, Stefano Lonardi. 493-498 [doi]
- Understanding captions in biomedical publicationsWilliam W. Cohen, Richard C. Wang, Robert F. Murphy. 499-504 [doi]
- Using randomized response techniques for privacy-preserving data miningWenliang Du, Zhijun Zhan. 505-510 [doi]
- Applications of sampling and fractional factorial designs to model-free data squashingWilliam DuMouchel, Deepak K. Agarwal. 511-516 [doi]
- Experiments with random projections for machine learningDmitriy Fradkin, David Madigan. 517-522 [doi]
- Accurate decision trees for mining high-speed data streamsJoão Gama, Ricardo Rocha, Pedro Medas. 523-528 [doi]
- Correlating synchronous and asynchronous data streamsSudipto Guha, Dimitrios Gunopulos, Nick Koudas. 529-534 [doi]
- A Web page prediction model based on click-stream tree representation of user behaviorSule Gündüz, M. Tamer Özsu. 535-540 [doi]
- Natural communities in large linked networksJohn E. Hopcroft, Omar Khan, Brian Kulis, Bart Selman. 541-546 [doi]
- Navigating massive data sets via local clusteringMichael E. Houle. 547-552 [doi]
- Mining viewpoint patterns in image databasesWynne Hsu, Jing Dai, Mong-Li Lee. 553-558 [doi]
- Playing hide-and-seek with correlationsChris Jermaine. 559-564 [doi]
- Interactive exploration of coherent patterns in time-series gene expression dataDaxin Jiang, Jian Pei, Aidong Zhang. 565-570 [doi]
- Efficient decision tree construction on streaming dataRuoming Jin, Gagan Agrawal. 571-576 [doi]
- A bag of paths model for measuring structural similarity in Web documentsSachindra Joshi, Neeraj Agrawal, Raghu Krishnapuram, Sumit Negi. 577-582 [doi]
- Nantonac collaborative filtering: recommendation based on order responsesToshihiro Kamishima. 583-588 [doi]
- A two-way visualization method for clustered dataYehuda Koren, David Harel. 589-594 [doi]
- Empirical comparisons of various voting methods in baggingKelvin T. Leung, Douglas Stott Parker Jr.. 595-600 [doi]
- Mining data records in Web pagesBing Liu, Robert L. Grossman, Yanhong Zhai. 601-606 [doi]
- On computing, storing and querying frequent patternsGuimei Liu, Hongjun Lu, Wenwu Lou, Jeffrey Xu Yu. 607-612 [doi]
- Online novelty detection on temporal sequencesJunshui Ma, Simon Perkins. 613-618 [doi]
- Distributed cooperative mining for information consortiaSatoshi Morinaga, Kenji Yamanishi, Jun-ichi Takeuchi. 619-624 [doi]
- Learning relational probability treesJennifer Neville, David Jensen, Lisa Friedland, Michael Hay. 625-630 [doi]
- Graph-based anomaly detectionCaleb C. Noble, Diane J. Cook. 631-636 [doi]
- Carpenter: finding closed patterns in long biological datasetsFeng Pan, Gao Cong, Anthony K. H. Tung, Jiong Yang, Mohammed Javeed Zaki. 637-642 [doi]
- New unsupervised clustering algorithm for large datasetsWilliam Peter, John Chiochetti, Clare Giardina. 643-648 [doi]
- Improving spatial locality of programs via data miningKarlton Sequeira, Mohammed Javeed Zaki, Boleslaw K. Szymanski, Christopher D. Carothers. 649-654 [doi]
- Mining phenotypes and informative genes from gene expression dataChun Tang, Aidong Zhang, Jian Pei. 655-660 [doi]
- Weighted Association Rule Mining using weighted support and significance frameworkFeng Tao, Fionn Murtagh, Mohsen Farid. 661-666 [doi]
- PaintingClass: interactive construction, visualization and exploration of decision treesSoon Tee Teoh, Kwan-Liu Ma. 667-672 [doi]
- Time and sample efficient discovery of Markov blankets and direct causal relationsIoannis Tsamardinos, Constantin F. Aliferis, Alexander R. Statnikov. 673-678 [doi]
- Distributed multivariate regression based on influential observationsHang Yu, Ee-Chien Chang. 679-684 [doi]
- Efficiently handling feature redundancy in high-dimensional dataLei Yu, Huan Liu. 685-690 [doi]
- An adaptive nearest neighbor search for a parts acquisition ePortalRafael Alonso, Jeffrey A. Bloom, Hua Li, Chumki Basu. 693-698 [doi]
- Architecting a knowledge discovery engine for military commanders utilizing massive runs of simulationsPhilip S. Barry, Jianping Zhang, Mary McDonald. 699-704 [doi]
- Data quality through knowledge engineeringTamraparni Dasu, Gregg T. Vesonder, Jon R. Wright. 705-710 [doi]
- Similarity analysis on government regulationsGloria T. Lau, Kincho H. Law, Gio Wiederhold. 711-716 [doi]
- Experimental design for solicitation campaignsUwe F. Mayer, Armand Sarkissian. 717-722 [doi]
- Towards NIC-based intrusion detectionMatthew Eric Otey, Srinivasan Parthasarathy, Amol Ghoting, G. Li, Sundeep Narravula, Dhabaleswar K. Panda. 723-728 [doi]
- Data-driven validation, completion and construction of event relationship networksChang-Shing Perng, David Thoenen, Genady Grabarnik, Sheng Ma, Joseph L. Hellerstein. 729-734 [doi]
- Visualizing concept driftKevin B. Pratt, Gleb Tschapek. 735-740 [doi]
- Experimental study of discovering essential information from customer inquiryKeiko Shimazu, Atsuhito Momma, Koichi Furukawa. 741-746 [doi]
- Applying data mining in investigating money laundering crimesZhongfei (Mark) Zhang, John J. Salerno, Philip S. Yu. 747-752 [doi]