Follow
Shannon Zejiang Shen
Title
Cited by
Cited by
Year
LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis
Z Shen, R Zhang, M Dell, BCG Lee, J Carlson, W Li
Document Analysis and Recognition–ICDAR 2021: 16th International Conference …, 2021
1102021
A large dataset of historical Japanese documents with complex layouts
Z Shen, K Zhang, M Dell
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
462020
The semantic scholar open data platform
R Kinney, C Anastasiades, R Authur, I Beltagy, J Bragg, A Buraczynski, ...
arXiv preprint arXiv:2301.10140, 2023
422023
Deep learning based framework for automatic damage detection in aircraft engine borescope inspection
Z Shen, X Wan, F Ye, X Guan, S Liu
2019 International Conference on Computing, Networking and Communications …, 2019
362019
Multi-lexsum: Real-world summaries of civil rights lawsuits at multiple granularities
Z Shen, K Lo, L Yu, N Dahlberg, M Schlanger, D Downey
Advances in Neural Information Processing Systems 35, 13158-13173, 2022
332022
VILA: Improving structured content extraction from scientific PDFs using visual layout groups
Z Shen, K Lo, LL Wang, B Kuehl, DS Weld, D Downey
Transactions of the Association for Computational Linguistics 10, 376-392, 2022
30*2022
Don't Say What You Don't Know: Improving the Consistency of Abstractive Summarization by Constraining Beam Search
D King*, Z Shen*, N Subramani, DS Weld, I Beltagy, D Downey
arXiv preprint arXiv:2203.08436, 2022
242022
PAWLS: PDF annotation with labels and structure
M Neumann, Z Shen, S Skjonsberg
arXiv preprint arXiv:2101.10281, 2021
152021
OLALA: Object-level active learning for efficient document layout annotation
Z Shen, J Zhao, M Dell, Y Yu, W Li
arXiv preprint arXiv:2010.01762, 2020
13*2020
Dolma: An Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
L Soldaini, R Kinney, A Bhagia, D Schwenk, D Atkinson, R Authur, ...
arXiv preprint arXiv:2402.00159, 2024
122024
Beyond summarization: Designing ai support for real-world expository writing tasks
Z Shen, T August, P Siangliulue, K Lo, J Bragg, J Hammerbacher, ...
arXiv preprint arXiv:2304.02623, 2023
112023
The semantic reader project: Augmenting scholarly documents through ai-powered interactive reading interfaces
K Lo, JC Chang, A Head, J Bragg, AX Zhang, C Trier, C Anastasiades, ...
arXiv preprint arXiv:2303.14334, 2023
92023
American stories: A large-scale structured text dataset of historical us newspapers
M Dell, J Carlson, T Bryan, E Silcock, A Arora, Z Shen, L D'Amico-Wong, ...
Advances in Neural Information Processing Systems 36, 2024
72024
Information Extraction from Text Regions with Complex Tabular Structure.
K Zhang, Z Shen, J Zhou, M Dell
Conference on Neural Information Processing Systems, 2019
52019
A Design Space for Intelligent and Interactive Writing Assistants
M Lee, KI Gero, JJY Chung, SB Shum, V Raheja, H Shen, S Venugopalan, ...
arXiv preprint arXiv:2403.14117, 2024
22024
Conceptualizing machine learning for dynamic information retrieval of electronic health record notes
S Jiang, S Shen, M Agrawal, B Lam, N Kurtzman, S Horng, DR Karger, ...
Machine Learning for Healthcare Conference, 343-359, 2023
22023
PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents
K Lo, Z Shen, B Newman, JZ Chang, R Authur, E Bransom, S Candra, ...
EMNLP 2023 : System Demonstrations (🏆 Best Paper Demo Award 🏆 ), 495-507, 2023
12023
Towards Verifiable Text Generation with Symbolic References
LT Hennigen*, S Shen*, A Nrusimha, B Gapp, D Sontag, Y Kim
arXiv preprint arXiv:2311.09188, 2023
12023
Are layout-infused language models robust to layout distribution shifts? a case study with scientific documents
C Chen, Z Shen, D Klein, G Stanovsky, D Downey, K Lo
arXiv preprint arXiv:2306.01058, 2023
12023
Generating object stamps
YA Mejjati, Z Shen, M Snower, A Gokaslan, O Wang, J Tompkin, KI Kim
arXiv preprint arXiv:2001.02595, 2020
12020
The system can't perform the operation now. Try again later.
Articles 1–20