


default search action
DocEng 2025: Nottingham, UK
- Steven R. Bagley, Steven J. Simske, Charlotte Curtis, Cerstin Mahlow:

Proceedings of the 2025 ACM Symposium on Document Engineering, DocEng 2025, Nottingham, UK, September 2-5, 2025. ACM 2025, ISBN 979-8-4007-1351-4
Keynote Talks
- Debora Weber-Wulff

:
Detecting and Documenting Plagiarism and GenAI Use. 1:1 - Charles Nicholas

:
Issues in Document Security. 2:1
Tutorials
- Sirisha Velampalli

:
LLM-assisted Automatic Feature Extraction for Document Understanding and Analytics. 3:1-3:2 - Frank Mittelbach

, Ulrike Fischer
, David Carlisle
, Joseph Wright
:
Well-Tagged PDF and Universal Accessibility with LATEX. 4:1
Editorials
- Ethan V. Munson

:
Celebrating 25 Years of Document Engineering. 5:1-5:3 - Gustavo P. Chaves

, Thaylor Vieira
, Gabriel de F. P. e Silva
, Rafael Dueire Lins
, Steven J. Simske
:
Binarizing Photographed Document Images 2025 Quality, Time and Space Assessment. 6:1-6:10
Document Information Retrieval
- Patrick Healy:

Session details: Document Information Retrieval. - Besat Kassaie

, Andrew Kane
, Frank Wm. Tompa
:
Exploiting Query Reformulation and Reciprocal Rank Fusion in Math-Aware Search Engines. 7:1-7:10 - Daniel Travaglia

, Jesper Findahl
, Marco D'Ambros
, Andrea Mocci
, Raphael Parchet
:
Mining a Century of Swiss Trademark Data. 8:1-8:10 - Antoine Boiteau

, Yann Mathet
, Antoine Widlöcher
:
OPERA: An Environment Extending Coreference Annotation to Relations Between Entities. 9:1-9:10 - Ryan C. Barron

, Maksim Ekin Eren
, Valentin G. Stanev
, Cynthia Matuszek
, Boian S. Alexandrov
:
Topic Modeling and Link-Prediction for Material Property Discovery. 10:1-10:4
Optical Character Recognition
- Steve Simske:

Session details: Optical Character Recognition. - David Villanova-Aparisi

, Carlos D. Martínez-Hinarejos
, Verónica Romero
, Moisés Pastor-i-Gadea
:
Improving Lightweight Named Entity Recognition in Handwritten Documents by Predicting Pyramidal Histograms of Characters. 11:1-11:9 - Philipp Hildebrandt

, Maximilian Schulze
, Sarel Cohen
, Vanja Doskoc, Raid Saabni
, Tobias Friedrich
:
Text Image Super-Resolution for Improved OCR in Real-Life Scenarios using Swin Transformers. 12:1-12:9 - Alexander Most

, Joseph Winjum
, Manish Bhattarai
, Shawn Jones
, Nishath Rajiv Ranasinghe
, Ayan Biswas
, Dan O'Malley
:
Lost in OCR Translation? Vision-Based Approaches to Robust Document Retrieval. 13:1-13:10 - Sávio Santos de Araújo

, Byron Leite Dantas Bezerra
, Arthur Flor de Sousa Neto
:
A Proposal of Post-OCR Spelling Correction Using Monolingual Byte-level Language Models. 14:1-14:4 - Andreas Evaggelatos

, Konstantinos Palaiologos
, Basilis Gatos
, Panagiotis Kaddas
, Aikaterini Christopoulou
, Vassilis Katsouros
:
Old Greek OCR Result Correction Using LLMs. 15:1-15:4
Document Organization and Generation
- Ethan Munson:

Session details: Document Organization and Generation. - Hassan Hussein

, Allard Oelen
, Sören Auer
:
A Hybrid, Neuro-symbolic Approach for Scholarly Knowledge Organization. 16:1-16:10 - Uwe M. Borghoff

, Peter Rödig
:
Preserving Measurement Data Records Long-term: A Field Study on Information Management in the Wake of the 1986 Chernobyl Disaster. 17:1-17:4 - Didier Verna

:
Towards More Homogeneous Paragraphs. 18:1-18:10 - Frank Mittelbach

, Ulrike Fischer
, David Carlisle
, Joseph Wright
:
MathML and other XML Technologies for Accessible PDF from LATEX. 19:1-19:4 - Shad Mohammad

, Elöd Egyed-Zsigmond
, Franck Lebourgeois
, Michiel Streijger
, Michela Bussotti
, Luis Tovar Pimentel
, Vincent Paillusson
:
Measuring temporal gains in assisted document transcription. 20:1-20:4
DocEng Demonstrations
- Cerstin Mahlow:

Session details: DocEng Demonstrations. - Jie Wang

:
A Comprehensive AI-Powered Editing and Typesetting Platform for Enhancing Academic Writing. 21:1-21:2 - Dominik Opitz

, Andreas Hamm
:
Use Case Demonstration @ DocEng2025: Conversation-Driven Multi-LLM Framework for Web Document Sentiment Analysis. 22:1-22:2 - Afonso Ferreira

, Cleber Zanchettin
, Romulo Andrade
, Byron Leite Dantas Bezerra
:
The Di2Win Document Intelligence Platform. 23:1-23:2
Document Classification
- Besat Kassaie:

Session details: Document Classification. - Shahriar Shayesteh

, Mukund Srinath
, Lee Matheson
, Lu Xian
, Sinjoy Saha
, C. Lee Giles
, Shomir Wilson
:
SoAC and SoACer: A Sector-Based Corpus and LLM-Based Framework for Sectoral Website Classification. 24:1-24:10 - Fatemeh Amerehi

, Patrick Healy
:
Robust Image Classifiers Fail Under Shifted Adversarial Perturbations. 25:1-25:8 - Zhijian Li

, Stefan Larson
, Kevin Leach
:
Document Classification using File Names. 26:1-26:10 - Stefan Larson

, Sharad Duwal
, Brian Vilnrotter
, Gayatri Chakkithara
, Vedant Padwal
, Kevin Leach
:
Spurious Cues in RVL-CDIP and Tobacco3482 Document Classification: The Case of ID Codes. 27:1-27:4
Document Analysis and Generation
- Didier Verna:

Session details: Document Analysis and Generation. - Anya Amel Nait Djoudi

, Patrice Bellot
, Adrian-Gabriel Chifu
:
BioReadNet: A Transformer-Driven Hybrid Model for Target Audience-Aware Biomedical Text Readability Assessment. 28:1-28:10 - Valeria Nardoni

, Kimiya Noor Ali
, Zahra Ziran
, Simone Marinai
:
Visual Large Language Models for Graphics Understanding: A Case Study on Floorplan Images. 29:1-29:4 - Cerstin Mahlow

:
Designing Visual Tools for Writing Process Analysis. 30:1-30:4 - Pablo Melendez Abarca

, Clemens Havas
:
Synthetic Document Generation with Full Annotation: A Framework Utilizing Open-Weight Large Language Models. 31:1-31:4 - Xavier Daull

, Elisabeth Murisasco
, Patrice Bellot
, Emmanuel Bruno
, Vincent Martin
:
An Adaptive Agentic Tool Building Architecture leveraging Expert-in-the-Loop Guidance, applied to Document Generation. 32:1-32:4
Document Trust and Security
- Charlotte Curtis:

Session details: Document Trust and Security. - Fatima-Taslima Hassan

, Richey Okoh-Michael
:
Reinforcing Document Privacy in Nigeria: A Framework for Trust in National Data Systems. 33:1-33:4 - Isaac Henry Teuscher

, Benjamin L. Schooley
:
Document Encryption in Practice: A Comparative Framework and Evaluation. 34:1-34:4 - Raguvir S

, Charles Nicholas
:
Hierarchical Clustering of the SOREL Malware Corpus. 35:1-35:4

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














