About Me
Hi! I’m Mehrab, a Research Scientist at Adobe Research since April 2023. Currently I am focusing on the automated optimization of LLM-based agents for reasoning, planning, and decision-making with the help of synthetic data generation to mitigate data scarcity challenges in the optimization and evaluation phases. Beyond this, my research interests span query understanding and disambiguation, trust and bias in generative AI systems, recommender systems, and designing scalable machine learning algorithms.
I earned my Ph.D. in Computer Science from the University of California, San Diego (UCSD), where I was advised by Prof. Garrison W. Cottrell. My doctoral research focused on enhancing fairness and robustness in generative models across diverse modalities and tasks, including text-to-image generation, image translation, and attribute-based image synthesis. During my Ph.D., I interned at Adobe Research (Summer 2020–2022), where I worked on debiasing generative models; at Etsy as a Data Science Intern (Summer 2019); and at The Home Depot as a Data Science Intern (Fall 2022). I am grateful to Adobe and The Home Depot for the generous gift funds to my advisor for supporting my research.
Before UCSD, I received my bachelor’s degree in Computer Science from the Bangladesh University of Engineering and Technology (BUET), where I worked with Dr. Muhammad Abdullah Adnan on scalable machine learning algorithms in distributed environments like Spark and Hadoop.
To learn more about my work, please visit publications.
Life Updates
[2025] Promoted to Research Scientist 2 in January!
[2024] Became a father in late October!
[2023] Joined Adobe as a Research Scientist in April!
Patents
Scalable Video Fingerprinting for Content Authenticity
Ritwik Sinha, Viswanathan Swaminathan, Simon Jenni, Md Mehrab Tanjim, John Collomosse
United States Patent Application 18/473045, 27 March 2025
link

ENHANCING NEXT ITEM RECOMMENDATION THROUGH CROSS-ATTENTION
Walid Shalaby, Xiquan Cui, Janani Balaji, Md Mehrab Tanjim
United States Patent Application 18/736,371, 12 December 2024
link

System and methods for diversity auditing
Md Mehrab Tanjim, Ritwik Sinha, Moumita Sinha, David Thomas Arbour, Sridhar Mahadevan
United States Patent Application 17/652,026, 3 Decemebr 2024
link

Debiasing image to image translation models
Md Mehrab Tanjim, Krishna Kumar Singh, Kushal Kafle, Ritwik Sinha
United States Patent Application 17/880,120, 8 February 2024
link

GENERATING SIMULATED IMAGES THAT ENHANCE SOCIO-DEMOGRAPHIC DIVERSITY
Md Mehrab Tanjim, Ritwik Sinha, Krishna Kumar Singh, Sridhar Mahadevan, David Arbour, Moumita Sinha
United States Patent Application 17/485780, 2023
link

Publications
Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt Ensembles.
Eric Slyman, Md Mehrab Tanjim, Kushal Kafle, Stefan Lee
ICCV, 2025

SKALD: Learning-Based Shot Assembly for Coherent Multi-Shot Video Creation.
Chen Yi Lu, Md Mehrab Tanjim, Ishita Dasgupta, Somdeb Sarkhel, Gang Wu, Saayan Mitra, Somali Chaterji
ICCV, 2025
pdf

VISIAR: Empower MLLM for Visual Story Ideation.
Zhaoyang Xia, Somdeb Sarkhel, Md Mehrab Tanjim, Stefano Petrangeli, Ishita Dasgupta, Yuxiao Chen, JINXUAN XU, Di Liu, Saayan Mitra, Dimitris N. Metaxas
ACL, 2025

Diversify-verify-adapt: Efficient and Robust Retrieval-Augmented Ambiguous Question Answering.
Yeonjun In, Sungchul Kim, Ryan A Rossi, Md Mehrab Tanjim, Tong Yu, Ritwik Sinha, Chanyoung Park
NAACL, 2025
pdf

Self-debiasing large language models: Zero-shot recognition and reduction of stereotypes.
Isabel O Gallegos, Ryan A Rossi, Joe Barrow, Md Mehrab Tanjim, Tong Yu, Hanieh Deilamsalehy, Ruiyi Zhang, Sungchul Kim, Franck Dernoncourt
NAACL, 2025
pdf

Bias and fairness in large language models: A survey
Isabel O Gallegos, Ryan A Rossi, Joe Barrow, Md Mehrab Tanjim, Sungchul Kim, Franck Dernoncourt, Tong Yu, Ruiyi Zhang, Nesreen K Ahmed
Computational Linguistics, 2024
pdf

ECLAIR: Enhanced Clarification for Interactive Responses.
John Murzaku, Zifan Liu, Md Mehrab Tanjim, Vaishnavi Muppala, Xiang Chen, Yunyao Li
IAAI 2025
pdf

RECON: Training-Free Acceleration for Text-to-Image Synthesis with Retrieval of Concept Prompt Trajectories.
Chen-Yi Lu, Shubham Agarwal, Md Mehrab Tanjim, Kanak Mahadik, Anup Rao, Subrata Mitra, Shiv Kumar Saini, Saurabh Bagchi, Somali Chaterji
ECCV, 2024
pdf

Discovering and Mitigating Biases in CLIP-based Text-to-Image Generation.
Md Mehrab Tanjim, Krishna Kumar Singh, Kushal Kafle, Ritwik Sinha, Garrison W. Cottrell
WACV, 2024
pdf

Debiasing Image-to-Image Translation Models.
Md Mehrab Tanjim, Krishna Kumar Singh, Kushal Kafle, Ritwik Sinha, Garrison W. Cottrell
British Machine Vision Conference (BMVC), 2022
pdf

Generating and Controlling Diversity in Image Search.
Md Mehrab Tanjim, Ritwik Sinha, Krishna Kumar Singh, Sridhar Mahadevan, David Arbour, Moumita Sinha, Garrison W. Cottrell
Winter Conference on Applications of Computer Vision (WACV), 2022
pdf

Fast, scalable and geo-distributed PCA for big data analytics
TM Tariq Adnan, Md Mehrab Tanjim, Mummad Abdullah Adnan
Information Systems, Elsevier, 2021
pdf | code
DynamicRec: A Dynamic Convolutional Network for Next Item Recommendation
Md Mehrab Tanjim, Hammad A. Ayyubi, Garrison W. Cottrell
Conference on Information and Knowledge Management (CIKM), 2020
pdf | code
Attentive sequential models of latent intent for next item recommendation
Md Mehrab Tanjim, Congzhe Su, Ethan Benjamin, Dian Hu, Liangjie Hong, Julian McAuley
World Wide Web (WWW), 2020
pdf | code

sSketch: A scalable sketching technique for PCA in the cloud.
Md Mehrab Tanjim, Muhammad Abdullah Adnan
Web Search & Data Mining (WSDM), 2018
pdf | code
Projects
Bias and Fairness
Discovering and Mitigating Biases in CLIP-based Text-to-Image Generation
Discovered the queries for which the popular CLIP model biases the generated images in the text-to-image synthesis task and proposed several ways to mitigate the biases without retraining CLIP or the underlying generative model.
Tech Stack: Pytorch

Debiasing Image-to-Image Translation Models
Pretrained StyleGAN2 based networks show various biases in different image-to-image translation tasks (such as super-resolution, sketch-to-image, etc.). Mitigated this bias issue using contrastive learning and uniform sampling of minority attributes.
Tech Stack: Pytorch

Bias Detection in Image Search and Mitigation
Identified the bias issue in the image results for search queries, proposed a way to audit. In addition, proposed an attribute-controlled style-based generator to create new content to mitigate such biases and enrich user experience.
Tech Stack: Pytorch, Tensorflow

Recommender Systems
Dynamic Convolution
Built an adaptive convolution network which changes its kernel dynamically depending on the current input (~10% better recommendations).
Tech Stack: Pytorch

Intent Detection for Recommendation
Captured users’ hidden intents (i.e. explore, purchase) from their interactions by designing a hierarchical Transformer model. It first discovers these intents and then pays attention to them for next item prediction (improved personalized recommendations by 5%).
Tech Stack: Tensorflow

Lightweight Convolutional Network for Recommendation
Improved the scalability of sequential recommender methods by modelling a scalable depth-wise separable 1D convolution neural network (requires ~30% less memory).
Tech Stack: Tensorflow

Multi-modal Learning
Visual Commonsense Reasoning.
Enforced reasoning for ans. prediction on VCR by building a differentiable module which jointly trains ans. and rationale prediction (performed better in leaderboard).
Tech Stack: Pytorch

Rationale Generation
Tasked state-of-the-art Visual Question Answering model (ViLBERT) with rationale generation (using GPT-2) to interpret/justify answer prediction. It improves accuracy by 1.5% as well.
Tech Stack: Pytorch

Scalable Machine Learning
Scalable Video Fingerprinting
Built a scalable, end-to-end pipeline using FAISS library that can trace a manipulated video in less than a second from a trusted database with millions of corpuses.
Tech Stack: Tensorflow

Distributed Algorithm Design
Extended both Spark and Hadoop for creating geo-distributed clusters in AWS and designed geo-distributed algorithms for higher dimension data.
Tech Stack: Java, Spark, Hadoop

Scalable Principal Component Analysis.
Improved the scalability of PCA for large datasets (up to 83× better performance) using sketching technique.
Tech Stack: Java, Scala, Spark

Miscellaneous
Fair Image Search Engine using CLIP
Built a search engine by calculating the cosine similarity scores between CLIP embedding of the text query and images in the database and sorting from highest to lowest to show the most relevant images on the top. Additionally, employed a diversity constraint using the embedding for different demographics.
Tech Stack: Pytorch

Image Colorization using Cycle Consistency Loss
Explored the potential of using Cycle Consistency Loss between grey and colored images in Generative Adversarial Networks for generating true and vivid colors for black & white images.
Tech Stack: Pytorch

Pragmatic Probabilistic Principal Component Analysis
Using sketching techniques, derived a warm initialization for Expectation Maximization (EM) algorithm of Probabilistic PCA (PPCA). This speeds up convergence up to 2.25×.
Tech Stack: Java, Spark

Digitor: A Digital Circuit Simulator
Developed a digital circuit simulator app where one can draw digital circuits and simulate its behavior. It can automatically derive the boolean expression from the circuit and minimize it (using Quine–McCluskey algorithm) to suggest a simplified implementation.
Tech Stack: Java
