Sciweavers

695 search results - page 22 / 139
» XML Data Representation in Document Image Analysis
Sort
View
WEBI
2005
Springer
15 years 12 months ago
Integrating Element and Term Semantics for Similarity-Based XML Document Clustering
Structured link vector model (SLVM) is a recently proposed document representation that takes into account both structural and semantic information for measuring XML document simi...
Jianwu Yang, William K. Cheung, Xiaoou Chen
DRR
2004
15 years 9 months ago
SmartNails: display- and image-dependent thumbnails
In order to overcome poor readability of text and recognizability of image features in low resolution thumbnails, a novel image representation of compound document images - a Smar...
Kathrin Berkner, Edward L. Schwartz, Christophe Ma...
NLDB
2004
Springer
15 years 12 months ago
On Embedding Machine-Processable Semantics into Documents
—Most Web and legacy paper-based documents are available in human comprehensible text form, not readily accessible to or understood by computer programs. Here, we investigate an ...
Krishnaprasad Thirunarayan
ICDAR
2009
IEEE
16 years 1 months ago
A Realistic Dataset for Performance Evaluation of Document Layout Analysis
† There is a significant need for a realistic dataset on which to evaluate layout analysis methods and examine their performance in detail. This paper presents a new dataset (and...
Apostolos Antonacopoulos, David Bridson, Christos ...
ISEMANTICS
2010
15 years 8 months ago
STEX+: a system for flexible formalization of linked data
We present the STEX system, a semantic extension of LATEX, that allows for producing high-quality PDF documents for (proof)reading and printing, as well as semantic XML/OMDoc docu...
Andrea Kohlhase, Michael Kohlhase, Christoph Lange...