Sciweavers

134 search results - page 7 / 27
» das 2006
Sort
View
DAS
2006
Springer
15 years 10 months ago
A System for Converting PDF Documents into Structured XML Format
We present in this paper a system for converting PDF legacy documents into structured XML format. This conversion system first extracts the different streams contained in PDF files...
Hervé Déjean, Jean-Luc Meunier
DAS
2006
Springer
15 years 10 months ago
Performance Comparison of Six Algorithms for Page Segmentation
Abstract. This paper presents a quantitative comparison of six algorithms for page segmentation: X-Y cut, smearing, whitespace analysis, constrained text-line finding, Docstrum, an...
Faisal Shafait, Daniel Keysers, Thomas M. Breuel
DAS
2006
Springer
15 years 10 months ago
Retrieval from Document Image Collections
Abstract. This paper presents a system for retrieval of relevant documents from large document image collections. We achieve effective search and retrieval from a large collection ...
A. Balasubramanian, Million Meshesha, C. V. Jawaha...
DAS
2006
Springer
15 years 8 months ago
XCDF: A Canonical and Structured Document Format
Accessing the structured content of PDF document is a difficult task, requiring pre-processing and reverse engineering techniques. In this paper, we first present different methods...
Jean-Luc Bloechle, Maurizio Rigamonti, Karim Hadja...
DAS
2006
Springer
15 years 10 months ago
On Benchmarking of Invoice Analysis Systems
Abstract. An approach is presented to guide the benchmarking of invoice analysis systems, a specific, applied subclass of document analysis systems. The state of the art of benchma...
Bertin Klein, Stefan Agne, Andreas Dengel