The rapidly growing volume of available digital documents of various formats and the possibility to access these through Internet-based technologies, have led to the necessity to develop solid methods to properly organize and structure documents in large digital libraries and repositories. Due to the extremely large volumes of documents and to their unstructured form, most of the research efforts in this direction are dedicated to automatically infer structure and schemas that can help to better organize huge collections of documents and data. This book covers the latest advances in structure inference in heterogeneous collections of documents and data. The book brings a comprehensive view of the state-of-the-art in the area, presents some lessons learned and identifies new research issues, challenges and opportunities for further research agenda and developments.  The selected chapters cover a broad range of research issues, from theoretical approaches to case studies and best practices in the field. Researcher, software developers, practitioners and students interested in the field of learning structure and schemas from documents will find the comprehensive coverage of this book useful for their research, academic, development and practice activity.
Les mer
The rapidly growing volume of available digital documents of various formats and the possibility to access these through Internet-based technologies, have led to the necessity to develop solid methods to properly organize and structure documents in large digital libraries and repositories.
Les mer
From the content: Learning Structure and Schemas from Heterogeneous Domains in Networked Systems Surveyed.- Handling Hierarchically Structured Resources Addressing Interoperability Issues in Digital Libraries.- Administrative Document Analysis and Structure.- Automatic Document Layout Analysis through Relational Machine Learning.- Dataspaces: where structure and schema meet.
Les mer
The rapidly growing volume of available digital documents of various formats and the possibility to access these through Internet-based technologies, have led to the necessity to develop solid methods to properly organize and structure documents in large digital libraries and repositories. Due to the extremely large volumes of documents and to their unstructured form, most of the research efforts in this direction are dedicated to automatically infer structure and schemas that can help to better organize huge collections of documents and data. This book covers the latest advances in structure inference in heterogeneous collections of documents and data. The book brings a comprehensive view of the state-of-the-art in the area, presents some lessons learned and identifies new research issues, challenges and opportunities for further research agenda and developments.  The selected chapters cover a broad range of research issues, from theoretical approaches to case studies and best practices in the field. Researcher, software developers, practitioners and students interested in the field of learning structure and schemas from documents will find the comprehensive coverage of this book useful for their research, academic, development and practice activity.
Les mer
Presents State-Of-The-Art Methods for Structure Learning and Schema Inference Case Studies and Best Practices from Real Large Scale Digital Libraries, Repositories and Corpora Written by Leading Experts in the Field
Les mer

Produktdetaljer

ISBN
9783642229121
Publisert
2011-09-03
Utgiver
Vendor
Springer-Verlag Berlin and Heidelberg GmbH & Co. K
Aldersnivå
Research, P, 06
Språk
Product language
Engelsk
Format
Product format
Innbundet