About Me
Hi! I’m Mehrab, a Research Scientist at Adobe Research since April 2023. Currently I am focusing on the automated optimization of LLM-based agents for reasoning, planning, and decision-making with the help of synthetic data generation to mitigate data scarcity challenges in the optimization and evaluation phases. Beyond this, my research interests span query understanding and disambiguation, trust and bias in generative AI systems, recommender systems, and designing scalable machine learning algorithms.
I earned my Ph.D. in Computer Science from the University of California, San Diego (UCSD), where I was advised by Prof. Garrison W. Cottrell. My doctoral research focused on enhancing fairness and robustness in generative models across diverse modalities and tasks, including text-to-image generation, image translation, and attribute-based image synthesis. During my Ph.D., I interned at Adobe Research (Summer 2020–2022), where I worked on debiasing generative models; at Etsy as a Data Science Intern (Summer 2019); and at The Home Depot as a Data Science Intern (Fall 2022). I am grateful to Adobe and The Home Depot for the generous gift funds to my advisor for supporting my research.
Before UCSD, I received my bachelor’s degree in Computer Science from the Bangladesh University of Engineering and Technology (BUET), where I worked with Dr. Muhammad Abdullah Adnan on scalable machine learning algorithms in distributed environments like Spark and Hadoop.
To learn more about my work, please visit publications.
Life Updates
[2025] Promoted to Research Scientist 2 in January! 
[2024] Became a father in late October!
[2023] Joined Adobe as a Research Scientist in April! 
Patents
 Scalable Video Fingerprinting for Content Authenticity 
Ritwik Sinha, Viswanathan Swaminathan, Simon Jenni, Md Mehrab Tanjim, John Collomosse 
United States Patent Application 18/473045, 27 March 2025 
link

 ENHANCING NEXT ITEM RECOMMENDATION THROUGH CROSS-ATTENTION 
Walid Shalaby, Xiquan Cui, Janani Balaji, Md Mehrab Tanjim 
United States Patent Application 18/736,371, 12 December 2024 
link

 System and methods for diversity auditing 
Md Mehrab Tanjim, Ritwik Sinha, Moumita Sinha, David Thomas Arbour, Sridhar Mahadevan 
United States Patent Application 17/652,026, 3 Decemebr 2024 
link

 Debiasing image to image translation models 
Md Mehrab Tanjim, Krishna Kumar Singh, Kushal Kafle, Ritwik Sinha 
United States Patent Application 17/880,120, 8 February 2024 
link

 GENERATING SIMULATED IMAGES THAT ENHANCE SOCIO-DEMOGRAPHIC DIVERSITY 
Md Mehrab Tanjim, Ritwik Sinha, Krishna Kumar Singh, Sridhar Mahadevan, David Arbour, Moumita Sinha 
United States Patent Application 17/485780, 2023 
link

Publications
 Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt Ensembles. 
Eric Slyman, Md Mehrab Tanjim, Kushal Kafle, Stefan Lee 
ICCV, 2025 

 SKALD: Learning-Based Shot Assembly for Coherent Multi-Shot Video Creation. 
Chen Yi Lu, Md Mehrab Tanjim, Ishita Dasgupta, Somdeb Sarkhel, Gang Wu, Saayan Mitra, Somali Chaterji 
ICCV, 2025 
pdf

 VISIAR: Empower MLLM for Visual Story Ideation. 
Zhaoyang Xia, Somdeb Sarkhel, Md Mehrab Tanjim, Stefano Petrangeli, Ishita Dasgupta, Yuxiao Chen, JINXUAN XU, Di Liu, Saayan Mitra, Dimitris N. Metaxas 
ACL, 2025 

 Diversify-verify-adapt: Efficient and Robust Retrieval-Augmented Ambiguous Question Answering. 
Yeonjun In, Sungchul Kim, Ryan A Rossi, Md Mehrab Tanjim, Tong Yu, Ritwik Sinha, Chanyoung Park 
NAACL, 2025 
pdf

 Self-debiasing large language models: Zero-shot recognition and reduction of stereotypes. 
Isabel O Gallegos, Ryan A Rossi, Joe Barrow, Md Mehrab Tanjim, Tong Yu, Hanieh Deilamsalehy, Ruiyi Zhang, Sungchul Kim, Franck Dernoncourt 
NAACL, 2025 
pdf

 Bias and fairness in large language models: A survey 
Isabel O Gallegos, Ryan A Rossi, Joe Barrow, Md Mehrab Tanjim, Sungchul Kim, Franck Dernoncourt, Tong Yu, Ruiyi Zhang, Nesreen K Ahmed 
Computational Linguistics, 2024 
pdf

 ECLAIR: Enhanced Clarification for Interactive Responses. 
John Murzaku, Zifan Liu, Md Mehrab Tanjim, Vaishnavi Muppala, Xiang Chen, Yunyao Li 
IAAI 2025 
pdf

 RECON: Training-Free Acceleration for Text-to-Image Synthesis with Retrieval of Concept Prompt Trajectories. 
Chen-Yi Lu, Shubham Agarwal, Md Mehrab Tanjim, Kanak Mahadik, Anup Rao, Subrata Mitra, Shiv Kumar Saini, Saurabh Bagchi, Somali Chaterji 
ECCV, 2024 
pdf

 Discovering and Mitigating Biases in CLIP-based Text-to-Image Generation. 
Md Mehrab Tanjim, Krishna Kumar Singh, Kushal Kafle, Ritwik Sinha, Garrison W. Cottrell 
WACV, 2024 
pdf

 Debiasing Image-to-Image Translation Models. 
Md Mehrab Tanjim, Krishna Kumar Singh, Kushal Kafle, Ritwik Sinha, Garrison W. Cottrell 
British Machine Vision Conference (BMVC), 2022 
pdf

 Generating and Controlling Diversity in Image Search. 
Md Mehrab Tanjim, Ritwik Sinha, Krishna Kumar Singh, Sridhar Mahadevan, David Arbour, Moumita Sinha, Garrison W. Cottrell 
Winter Conference on Applications of Computer Vision (WACV), 2022 
pdf

 Fast, scalable and geo-distributed PCA for big data analytics 
TM Tariq Adnan, Md Mehrab Tanjim, Mummad Abdullah Adnan 
Information Systems, Elsevier, 2021 
pdf | code
 DynamicRec: A Dynamic Convolutional Network for Next Item Recommendation 
Md Mehrab Tanjim, Hammad A. Ayyubi, Garrison W. Cottrell 
Conference on Information and Knowledge Management (CIKM), 2020 
pdf | code
 Attentive sequential models of latent intent for next item recommendation 
Md Mehrab Tanjim, Congzhe Su, Ethan Benjamin, Dian Hu, Liangjie Hong, Julian McAuley 
World Wide Web (WWW), 2020 
pdf | code

 sSketch: A scalable sketching technique for PCA in the cloud. 
Md Mehrab Tanjim, Muhammad Abdullah Adnan 
Web Search & Data Mining (WSDM), 2018 
pdf | code
Projects
Bias and Fairness
 Discovering and Mitigating Biases in CLIP-based Text-to-Image Generation 
Discovered the queries for which the popular CLIP model biases the generated images in the text-to-image synthesis task and proposed several ways to mitigate the biases without retraining CLIP or the underlying generative model. 
 Tech Stack: Pytorch

 Debiasing Image-to-Image Translation Models 
Pretrained StyleGAN2 based networks show various biases in different image-to-image translation tasks (such as super-resolution, sketch-to-image, etc.). Mitigated this bias issue using contrastive learning and uniform sampling of minority attributes. 
 Tech Stack: Pytorch

 Bias Detection in Image Search and Mitigation 
Identified the bias issue in the image results for search queries, proposed a way to audit. In addition, proposed an attribute-controlled style-based generator to create new content to mitigate such biases and enrich user experience. 
 Tech Stack: Pytorch, Tensorflow

Recommender Systems
 Dynamic Convolution 
Built an adaptive convolution network which changes its kernel dynamically depending on the current input (~10% better recommendations). 
 Tech Stack: Pytorch

 Intent Detection for Recommendation 
Captured users’ hidden intents (i.e. explore, purchase) from their interactions by designing a hierarchical Transformer model. It first discovers these intents and then pays attention to them for next item prediction (improved personalized recommendations by 5%). 
 Tech Stack: Tensorflow

 Lightweight Convolutional Network for Recommendation 
Improved the scalability of sequential recommender methods by modelling a scalable depth-wise separable 1D convolution neural network (requires ~30% less memory). 
 Tech Stack: Tensorflow

Multi-modal Learning
 Visual Commonsense Reasoning. 
Enforced reasoning for ans. prediction on VCR by building a differentiable module which jointly trains ans. and rationale prediction (performed better in leaderboard). 
 Tech Stack: Pytorch

 Rationale Generation 
Tasked state-of-the-art Visual Question Answering model (ViLBERT) with rationale generation (using GPT-2) to interpret/justify answer prediction. It improves accuracy by 1.5% as well. 
 Tech Stack: Pytorch

Scalable Machine Learning
 Scalable Video Fingerprinting 
Built a scalable, end-to-end pipeline using FAISS library that can trace a manipulated video in less than a second from a trusted database with millions of corpuses. 
 Tech Stack: Tensorflow

 Distributed Algorithm Design 
Extended both Spark and Hadoop for creating geo-distributed clusters in AWS and designed geo-distributed algorithms for higher dimension data. 
 Tech Stack: Java, Spark, Hadoop

 Scalable Principal Component Analysis. 
Improved the scalability of PCA for large datasets (up to 83× better performance) using sketching technique. 
 Tech Stack: Java, Scala, Spark

Miscellaneous
 Fair Image Search Engine using CLIP 
Built a search engine by calculating the cosine similarity scores between CLIP embedding of the text query and images in the database and sorting from highest to lowest to show the most relevant images on the top. Additionally, employed a diversity constraint using the embedding for different demographics. 
 Tech Stack: Pytorch

 Image Colorization using Cycle Consistency Loss 
Explored the potential of using Cycle Consistency Loss between grey and colored images in Generative Adversarial Networks for generating true and vivid colors for black & white images. 
 Tech Stack: Pytorch

 Pragmatic Probabilistic Principal Component Analysis 
 Using sketching techniques, derived a warm initialization for Expectation Maximization (EM) algorithm of Probabilistic PCA (PPCA). This speeds up convergence up to 2.25×. 
 Tech Stack: Java, Spark

 Digitor: A Digital Circuit Simulator 
Developed a digital circuit simulator app where one can draw digital circuits and simulate its behavior. It can automatically derive the boolean expression from the circuit and minimize it (using Quine–McCluskey algorithm) to suggest a simplified implementation.
 Tech Stack: Java

