Sciweavers

5628 search results - page 135 / 1126
» Data challenges at Yahoo!
Sort
View
EDBT
2009
ACM
104views Database» more  EDBT 2009»
15 years 4 months ago
Fair, effective, efficient and differentiated scheduling in an enterprise data warehouse
A typical online Business Intelligence (BI) workload consists of a combination of short, less intensive queries, along with long, resource intensive queries. As such, the longest ...
Chetan Gupta, Abhay Mehta, Song Wang, Umeshwar Day...
ECEASST
2010
15 years 4 months ago
Integrating Data from Multiple Repositories to Analyze Patterns of Contribution in FOSS Projects
: The majority of Free and Open Source Software (FOSS) developers are mobile and often use different identities in the projects or communities they participate in. These characteri...
Sulayman K. Sowe, Antonio Cerone
COLING
2010
15 years 1 months ago
An Empirical Study on Web Mining of Parallel Data
This paper1 presents an empirical approach to mining parallel corpora. Conventional approaches use a readily available collection of comparable, nonparallel corpora to extract par...
Gum-Won Hong, Chi-Ho Li, Ming Zhou, Hae-Chang Rim
EMNLP
2011
14 years 6 months ago
Data-Driven Response Generation in Social Media
We present a data-driven approach to generating responses to Twitter status posts, based on phrase-based Statistical Machine Translation. We find that mapping conversational stim...
Alan Ritter, Colin Cherry, William B. Dolan
ACL
2012
13 years 9 months ago
Personalized Normalization for a Multilingual Chat System
This paper describes the personalized normalization of a multilingual chat system that supports chatting in user defined short-forms or abbreviations. One of the major challenges ...
Ai Ti Aw, Lian Hau Lee