top of page

Publications

2025

TANQ: An open domain dataset of table answered questions.

Mubashara Akhtar,* Chenxi Pang,* Andreea Marzoca, Yasemin Altun, Julian Martin Eisenschlos

Transactions of the Association for Computational Linguistics (TACL), 2025.

The 2nd automated verification of textual claims (AVeriTeC) shared task: Open-weights, reproducible and efficient systems.

Mubashara Akhtar, Rami Aly, Yulong Chen, Zhenyun Deng, Michael Schlichtkrull, Chenxi Whitehouse, Andreas Vlachos

Proceedings of the Eighth Fact Extraction and VERification Workshop (FEVER).

LEXam: Benchmarking Legal Reasoning on 340 Law Exams.

Yu Fan, Jingwei Ni, Jakob Merane, Etienne Salimbeni, Yang Tian, Yoan Hermstrüwer, Yinya Huang, Mubashara Akhtar, et al.

Arxiv, 2025.

Reasoning with Confidence: Efficient Verification of LLM Reasoning Steps via Uncertainty Heads.

Jingwei Ni, Ekaterina Fadeeva, Tianyi Wu, Mubashara Akhtar, Jiaheng Zhang, Elliott Ash, Markus Leippold, Timothy Baldwin, See-Kiong Ng, Artem Shelmanov, Mrinmaya Sachan. 

Arxiv, 2025.

Compose and Fuse: Revisiting the Foundational Bottlenecks in Multimodal Reasoning.

Yucheng Wang, Yifan Hou, Aydin Javadov, Mubashara Akhtar, Mrinmaya Sachan. 

Arxiv, 2025.

Chimera: Diagnosing Shortcut Learning in Visual-Language Understanding.

Ziheng Chi, Yifan Hou, Chenxi Pang, Shaobo Cui, Mubashara Akhtar, Mrinmaya Sachan. 

Arxiv, 2025.

2024

The Automated Verification of Textual Claims (AVeriTeC) Shared Task.

Michael Schlichtkrull, Yulong Chen, Chenxi Whitehouse, Zhenyun Deng, Mubashara Akhtar, et al.

Proceedings of the Sixth Fact Extraction and VERification Workshop (FEVER), 2024.

Proceedings of the Seventh Fact Extraction and VERification Workshop (FEVER).

Michael Schlichtkrull, Yulong Chen, Chenxi Whitehouse, Zhenyun Deng, Mubashara Akhtar, et al.

Proceedings of the Sixth Fact Extraction and VERification Workshop (FEVER), 2024.

Croissant: A Metadata Format for ML-Ready Datasets.

Mubashara Akhtar,* Omar Benjelloun,* Costanza Conforti,* et al.

Advances in Neural Information Processing Systems 36 (NeurIPS 2024) as spotlight. (top ~3% of submissions)

Croissant: A Metadata Format for ML-Ready Datasets.

Mubashara Akhtar,* Omar Benjelloun,* Costanza Conforti,* et al.

DEEM workshop @SIGMOD, 2024. (Best Short Paper award 🥇)

A Standardized Machine-readable Dataset Documentation Format for Responsible AI.

Nitisha Jain,* Mubashara Akhtar,* Joan Giner-Miguelez,* Rajat Shinde,* et al.

ArXiv, 2024.

ChartCheck: Explainable Fact-Checking over Real-World Chart Images.

Mubashara Akhtar, Nikesh Subedi, Vivek Gupta, Sahar Tahmasebi, Oana Cocarascu, Elena Simperl

Findings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL), 2024.

2023

Exploring the Numerical Reasoning Capabilities of Language Models: A Comprehensive Analysis on Tabular Data.

Mubashara Akhtar, Abhilash Shankarampeta, Vivek Gupta, Arpit Patil, Oana Cocarascu, Elena Simperl

Findings of the Annual Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.

Multimodal Automated Fact-Checking: A Survey.

Mubashara Akhtar, Michael Schlichtkrull, Zhijiang Guo, Oana Cocarascu, Elena Simperl, Andreas Vlachos

Findings of the Annual Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.

Reading and Reasoning over Chart Images for Evidence-based Automated Fact-Checking.

Mubashara Akhtar, Oana Cocarascu, Elena Simperl

Findings of the Annual Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023.

Proceedings of the Sixth Fact Extraction and VERification Workshop (FEVER).

Mubashara Akhtar, Rami Aly, Christos Christodoulopoulos, Oana Cocarascu, Zhijiang Guo, Arpit Mittal, Michael Schlichtkrull, James Thorne, Andreas Vlachos

Proceedings of the Sixth Fact Extraction and VERification Workshop (FEVER), 2023.

2022

PubHealthTab: A public health table-based dataset for evidence-based fact checking.

Mubashara Akhtar, Oana Cocarascu, Elena Simperl

Findings of Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2022.

bottom of page