Sciweavers

88 search results - page 4 / 18
» Extensive Evaluation of Efficient NLP-Driven Text Classifica...
Sort
View
148
Voted
IMCSIT
2010
15 years 4 months ago
Semi-Automatic Extension of Morphological Lexica
Abstract--We present a tool that facilitates the efficient extension of morphological lexica. The tool exploits information from a morphological lexicon, a morphological grammar an...
Tobias Kaufmann, Beat Pfister
195
Voted
KDD
2006
ACM
179views Data Mining» more  KDD 2006»
16 years 6 months ago
Extracting key-substring-group features for text classification
In many text classification applications, it is appealing to take every document as a string of characters rather than a bag of words. Previous research studies in this area mostl...
Dell Zhang, Wee Sun Lee
173
Voted
BTW
2007
Springer
153views Database» more  BTW 2007»
15 years 10 months ago
Efficient Time-Travel on Versioned Text Collections
: The availability of versioned text collections such as the Internet Archive opens up opportunities for time-aware exploration of their contents. In this paper, we propose time-tr...
Klaus Berberich, Srikanta J. Bedathur, Gerhard Wei...
163
Voted
CIKM
2009
Springer
15 years 10 months ago
Improving binary classification on text problems using differential word features
We describe an efficient technique to weigh word-based features in binary classification tasks and show that it significantly improves classification accuracy on a range of proble...
Justin Martineau, Tim Finin, Anupam Joshi, Shamit ...
148
Voted
CIKM
2008
Springer
15 years 8 months ago
Error-driven generalist+experts (edge): a multi-stage ensemble framework for text categorization
We introduce a multi-stage ensemble framework, ErrorDriven Generalist+Expert or Edge, for improved classification on large-scale text categorization problems. Edge first trains a ...
Jian Huang 0002, Omid Madani, C. Lee Giles