Document Analysis and Recognition – ICDAR 2021

16th International Conference, Lausanne, Switzerland, September 5–10, 2021, Proceedings, Part II

Josep Lladós, Daniel Lopresti, Seiichi Uchida (Herausgeber)

Buch | Softcover

XX, 873 Seiten

2021 | 1st ed. 2021
Springer International Publishing (Verlag)
978-3-030-86330-2 (ISBN)

Artikel merken

This four-volume set of LNCS 12821, LNCS 12822, LNCS 12823 and LNCS 12824, constitutes the refereed proceedings of the 16^th International Conference on Document Analysis and Recognition, ICDAR 2021, held in Lausanne, Switzerland in September 2021. The 182 full papers were carefully reviewed and selected from 340 submissions, and are presented with 13 competition reports.

The papers are organized into the following topical sections: document analysis for literature search, document summarization and translation, multimedia document analysis, mobile text recognition, document analysis for social good, indexing and retrieval of documents, physical and logical layout analysis, recognition of tables and formulas, and natural language processing (NLP) for document understanding.

Document Analysis for Literature Search.- Towards Document Panoptic Segmentation with Pinpoint Accuracy: Method and Evaluation.- A Math Formula Extraction and Evaluation Framework for PDF Documents.- Toward Automatic Interpretation of 3D Plots.- Document Summarization and Translation.- Can Text Summarization Enhance the Headline Stance Detection Task? Benefits and Drawbacks.- The Biased Coin Flip Process for Nonparametric Topic Modeling.-CoMSum and SIBERT: A Dataset and Neural Model for Query-Based Multi-Document Summarization.- RTNet: An End-to-End Method for Handwritten Text Image Translation.- NTable: A Dataset for Camera-based Table Detection.- Multimedia document analysis.- Label Selection Algorithm Based on Boolean Interpolative Decomposition with Sequential Backward Selection for Multi-label Classification.- GSSF: A Generative Sequence Similarity Function based on a Seq2Seq model for clustering online handwritten mathematical answers.- C2VNet: A Deep Learning Framework Towards Comic Strip to Audio-Visual Scene Synthesis.- LSTMVAEF: Vivid Layout via LSTM-based Variational Autoencoder Framework.- Mobile Text Recognition.- HCRNN: A Novel Architecture for Fast Online Handwritten Stroke Classification.- RFDoc: memory efficient local descriptors for ID documents localization and classification.- Dynamic Receptive Field Adaptation for Attention-Based Text Recognition.- Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition.- MIDV-LAIT: a challenging dataset for recognition of IDs with Perso-Arabic, Thai, and Indian scripts,. Determining optimal frame processing strategies for real-time document recognition systems.- Document Analysis for Social Good.- Embedded Attributes for Cuneiform Sign Spotting.- Date Estimation in the Wild of Scanned Historical Photos: An Image Retrieval Approach.- Two-Step Fine-Tuned Convolutional Neural Networks for Multi-Label Classification of Children's Drawings.- DCINN: Deformable Convolution and Inception Based Neural Network for Tattoo Text Detection through Skin Region.- Sparse Document Analysis using Beta-Liouville Naive Bayes with Vocabulary Knowledge.- Automatic Signature-based Writer Identification in Mixed-script Scenarios.- Indexing and Retrieval of Documents.- Indexing and Retrieval of Documents.- Learning to Rank Words: Optimizing Ranking Metrics for Word Spotting.- A-VLAD: An End-to-End Attention-based Neural Network for Writer Identification in Historical Documents.- Manga-MMTL: multimodal multitask transfer learning for manga character analysis.- Probabilistic Indexing and Search for Hyphenated Words.- Physical and Logical Layout Analysis.- SandSlide: Automatic Slideshow Normalization.- Digital Editions as Distant Supervision for Layout Analysis of Printed Books.- Palmira: A Deep Deformable Network for Instance Segmentation of Dense and Uneven Layouts in Handwritten Manuscripts.- Page Layout Analysis System for Unconstrained Historic Documents.- Improved Graph Methods for Table Layout Understanding.- Unsupervised learning of text line segmentation by differentiating coarse patterns.- Recognition of Tables and Formulas.- Rethinking Table Structure Recognition Using Sequence Labeling Methods.- TabLeX: A Benchmark Dataset for Structure and Content Information Extraction from Scientific Tables.- Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer.- TabAug: Data Driven Augmentation for Enhanced Table Structure Recognition.- An Encoder-Decoder Approach to Handwritten Mathematical Expression Recognition with Multi-Head Attention and Stacked Decoder.- Global Context for improving recognition of Online Handwritten Mathematical Expressions.- Image-based Relation Classification Approach for Table Structure Recognition.- Image to LaTeX with Graph Neural Network for Mathematical Formula Recognition.- NLP for Document Understanding.- A Novel Method for AutomatedSuggestion of Similar Software Incidents using 2-Stage Filtering: Findings on Primary Data.- Research on pseudo-label technology for multi-label news classification.- Information Extraction from Invoices.- Are You Really Complaining? A Multi-task Framework for Complaint Identification, Emotion and Sentiment Classification.- Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.- Data Centric Domain Adaptation for Historical Text with OCR Errors.- Temporal Ordering of Events via Deep Neural Networks.- Document Collection Visual Question Answering.- Dialogue Act Recognition using Visual Information.- Are End-to-End Systems Really Necessary for NER on Handwritten Document Images?.- Training Bi-Encoders for Word Sense Disambiguation.- DeepCPCFG: Deep Learning and Context Free Grammars for End-to-End Information Extraction.- Consideration of the word's neighborhood in GATs for information extraction in semi-structured documents.

Erscheinungsdatum	07.09.2021
Reihe/Serie	Image Processing, Computer Vision, Pattern Recognition, and Graphics
Reihe/Serie	Lecture Notes in Computer Science
Zusatzinfo	XX, 873 p. 329 illus., 277 illus. in color.
Verlagsort	Cham
Sprache	englisch
Maße	155 x 235 mm
Gewicht	1347 g
Themenwelt	Informatik ► Grafik / Design ► Digitale Bildverarbeitung
Themenwelt	Informatik ► Theorie / Studium ► Künstliche Intelligenz / Robotik
Schlagworte	Applications • Artificial Intelligence • Character Recognition • Computational Linguistics • Computer Science • Computer systems • computer vision • conference proceedings • Databases • Data Mining • Education • Image Analysis • Image Processing • Image Segmentation • Informatics • Information Retrieval • Linguistics • machine learning • Natural Language Processing (NLP) • Natural Languages • pattern recognition • Research • Semantics • text processing
ISBN-10	3-030-86330-1 / 3030863301
ISBN-13	978-3-030-86330-2 / 9783030863302
Zustand	Neuware