Sciweavers

1863 search results - page 338 / 373
» A supervised learning approach for imbalanced data sets
Sort
View
WWW
2006
ACM
16 years 7 months ago
Interactive wrapper generation with minimal user effort
While much of the data on the web is unstructured in nature, there is also a significant amount of embedded structured data, such as product information on e-commerce sites or sto...
Utku Irmak, Torsten Suel
CCR
2006
76views more  CCR 2006»
15 years 6 months ago
Secure distributed data-mining and its application to large-scale network measurements
The rapid growth of the Internet over the last decade has been startling. However, efforts to track its growth have often fallen afoul of bad data -- for instance, how much traffi...
Matthew Roughan, Yin Zhang
KDD
2007
ACM
206views Data Mining» more  KDD 2007»
16 years 7 months ago
Automatic labeling of multinomial topic models
Multinomial distributions over words are frequently used to model topics in text collections. A common, major challenge in applying all such topic models to any text mining proble...
Qiaozhu Mei, Xuehua Shen, ChengXiang Zhai
ICDM
2005
IEEE
188views Data Mining» more  ICDM 2005»
16 years 6 days ago
Hierarchy-Regularized Latent Semantic Indexing
Organizing textual documents into a hierarchical taxonomy is a common practice in knowledge management. Beside textual features, the hierarchical structure of directories reflect...
Yi Huang, Kai Yu, Matthias Schubert, Shipeng Yu, V...
SAC
2010
ACM
15 years 1 months ago
A study on interestingness measures for associative classifiers
Associative classification is a rule-based approach to classify data relying on association rule mining by discovering associations between a set of features and a class label. Su...
Mojdeh Jalali Heravi, Osmar R. Zaïane