Our paper on Idea Generation got accepted to the EMNLP 2025 main conference!
Our dataset proposal on Complex Engineering Diagram Parsing (Enginuity) is accepted for the AI for Science workshop at NeurIPS 2025!
Our survey paper on AI for Spatial Transcriptomics has been accepted for the Imageomics workshop at NeurIPS 2025!
We released our Findings of the 3rd Automatic Minuting (AutoMin) Challenge at SIGDial 2025!
Please explore internship and educational opportunities at ORNL . Email me if you are eligible and interested in our work.
Our survey paper on HPC needs for modern-day Computational Biology research is accepted at the AI4Science workshop in ACM Supercomputing 2025!
Our paper on AI Agents for Autonomous Experiments is accepted at the XLOOP workshop in ACM Supercomputing 2025!
Presented our latest work on AI for Operations: Building Trustworthy AI Solutions in DOE Laboratory Operations at the ORNL Software Expo 2025 on September 9th
Will speak on Enhancing Safety at DOE Sites through Predictive AI: A Comprehensive Framework for Event Forecasting and Automated Work Control at the AIRES 6 workshop at ORNL on September 17th
Looking forward to hiring Ph.D.- enrolled graduate students (U.S. Nationals/LPR) via the DOE SCGSR program. Please reach out if interested in our line of work.
Our DD proposal on Agentic Exploration of Dark Matter and Dark Energy Through Cosmological Simulations got accepted for OLCF allocation! Project in collaboration with Prof. Brant Roberstson (UCSC) and Prof. Yuan-Sen Ting (OSU)
Organizing the Telescope Reference and Astronomy Categorization Shared Task (TRACS) at IJCNLP-AACL 2025
Our paper on Intelligent Manufacturing Support using LLMs is accepted at MSEC 2025!
Serving as the Area Chair (meta reviewer) in NAACL 2025, EMNLP 2025 and ACL 2025
Program Committee member in AAAI 2026, Sci-K @ WWW 2025, AI4Science at NeurIPS 2025, SCI 2025, COLM 2025, LREC 2026
A poster on AI for Predicting Vulnerabilities and Risks at DOE sites is accepted to the Smoky Mountain Data Conference 2025!
Gave a talk on how the AI4Ops ORNL/DOE projects could be transformed with a Federated Learning (FL) and Collaborative Learning (CL) framework with other DOE laboratories at the FL/CL workshop at ORNL
AstroSage-70B got accepted to the Machine Learning for Astrophysics Workshop co-located with ICML 2025!
3rd Workshop on Artificial Intelligence for Scientific Publications (WASP) will be held with IJCNLP-AACL 2025 (Hybrid). Deadline 👉 September 29, 2025
Paper on Agentic Scientific Workflows accepted in ReWorDS 25 workshop at eScience 2025!
Our DD proposal on Multi-agent LLMs for Scientific Hypothesis Generation got accepted for OLCF allocation! Project in collaboration with Dr. Jian Wu (ODU) and Dr. Sarah Rajtmajer (PSU)
The 5th Scholarly Document Processing Workshop (SDP) at ACL 2025 was a great success!
Our work on AI-generated Peer Review Detection accepted in NAACL 2025 and EMNLP 2024!
Gave a talk on Scientific Hypothesis Generation at the AI4Science workshop at ORNL
Organizing the 3rd Automatic Minuting Challenge at SIGDial 2025
Participated in the 1000 Scientist Jam with OpenAI at ORNL
Serving as the D&I chair at IJCNLP-AACL 2025
Our paper on 'Hypothesis Generation' has been retweeted over 100 times and 70K views!
Our DD proposal on Iterative Construction of Synthesis Knowledge Graphs got accepted for OLCF allocation! Project in collaboration with Prof. Elsa Olivetti (MIT) and Dr. Vineeth Venugopal (MIT)
Looking for a motivated PhD student to work on impactful AI4Science and AI4Operations problems!
I completed my Ph.D. in 2020 at the Indian Institute of Technology Patna, where I specialized in Computer Science and Engineering with a focus on Natural Language Processing and Machine Learning, under the guidance of Prof. Asif Ekbal and Prof. Pushpak Bhattacharyya. My thesis focused on applying Natural Language Processing and Machine Learning to Scholarly Documents for downstream problems, such as Novelty Detection, Scope Detection, and Peer Review processing in scholarly communications. Thereafter, I did my PostDoc (2020-2022) from the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University, Czech Republic, working on Speech and Language Processing in the Horizon 2020 EU projects European Live Translator (ELITR) and Neural Representations in Multi-modal and Multi-lingual Modelling (NEUREM3) with Prof. Ondřej Bojar. I was the Work Package lead of the Automatic Minuting module, where we developed multilingual methods and datasets for automatically generating minutes from multiparty meeting proceedings. I led the formation of the bi-annual Automatic Minuting shared task challenge and created the community around this problem. I am also the founding organizer of the Scholarly Document Processing workshop series, starting back in 2019. I also co-founded the Workshop for Artificial Intelligence for Scientific Publications (WASP) series with my colleagues at NASA/ADS at the CfA, Harvard Smithsonian, back in 2021.
I joined SciSpace (2022-2023) in Bengaluru, India (headquartered in Palo Alto, California) as the Head of Research, leading the Natural Language Processing and Machine Learning team (which I built from scratch). We developed a semantic search engine and an AI tool to support academic research, addressing several scholarly use-cases. I was also the principal science advisor of the Automotive Repair Intelligence company Predii and an NLP Consultant in the litigation support services company Lexitas, both based in Palo Alto, California. I finally joined ORNL in May 2023 and relocated to the US. I have served as Principal Investigator and Advisor in several industry-funded projects from Cactus Communications, Acta.ai, Lexitas, RAx, ORKG, and Elsevier.
Before joining my Ph.D. program at IIT Patna, I was an Assistant Professor of Computer Science and Engineering (2012-2016) at the Sikkim Manipal Institute of Technology, Sikkim, India. I hold two Master’s degrees: one in Computer Applications (MCA) and the other in Computer Science and Engineering (M.Tech.). I am a gold medalist (University First Rank Holder) in both my Bachelor’s and Master’s degrees from the University of North Bengal, India.
I currently live in Knoxville, TN, with my wife and daughter. I am originally from Siliguri, a city at the foothills of the Himalayas in West Bengal, India where I grew up in the lush green tea gardens of the terai-dooars area. If you are a Darjeeling Tea lover, you have my attention! I enjoy travelling (especially road trips), exploring new places, cooking spicy Indian food, reading Bengali literature, watching good movies, and, above all, playing with my little one. My daughter’s name is Sharanya, which means one who gives refuge to others. I am also actively engaged in the cultural events organization and shaping up the East Tennessee Bengali Association (ETBA) in Knoxville.
I envision a future where Artificial Intelligence (or AGI or Super Intelligence) plays a pivotal role in solving the majority of our global challenges, enabling us to become an AI-augmented human civilization, yet having empathy at the core of development and progress. I do believe love can transcend time, space, gravity (Interstellar)! I deeply connect with Dave Patterson's reflections after a glorious career in Computer Science.