Adithya S Kolavi

AI Researcher, Building Generative AI solutions at Scale

SK

About

Building AI that actually works! Currently deep into Vision-Language Models and Agentic Systems, with hands-on experience taking AI projects from wild ideas to real products. Love tinkering with model fine-tuning and cloud deployments. Big open-source enthusiast - you'll find me contributing to projects that make AI more accessible to everyone.

Work Experience

Featured Open Source Work

Academic Publications

Research papers and academic contributions

ICCV 2025

Nayana: A Foundation for Document-Centric Vision-Language Models via Multi-Task, Multimodal, and Multilingual Data Syn-thesis

Workshop on Computer Vision for Developing Countries (CV4DC)2025
Accepted

A comprehensive approach to generating synthetic datasets for training vision-language models on document understanding tasks across multiple languages.

Dataset Generation
Multimodal AI
Document Understanding
CVPR 2025

ViViD - Vision Language model for Unified Visual Understanding of Documents

Emergent Visual Abilities and Limits of Foundation Models (EVAL-FoMo 2025)2025
Accepted

A vision-language model specifically optimized for document understanding tasks, capable of processing diverse document formats with high accuracy.

Vision-Language Models
Document Understanding
Multimodal AI
Coming Soon
NAACL 2025

Nayana OCR: A Scalable Framework for Document OCR in Low-Resource Languages

Language Models for Underserved Communities2025
Accepted

Development of a specialized OCR system designed for low-resource Indic languages, addressing unique challenges in character recognition and document processing.

OCR
Low-Resource Languages
Document Processing

Achievements & News

Latest updates, recognitions, and highlights

Omniparse Hits 6500 Stars on GitHub

April 2025

Omniparse, our open-source document parsing library, has reached 6500 stars on GitHub, making it one of the most popular libraries for document processing.

Open Source
GitHub
Milestone
View Repository

Awarded LLaMA Impact Grant by Meta AI

April 2025

Cognitivelab was seleted as one of the recipients of Meta's LLaMA Impact Grant for our work on extending large language models to under-resourced Indic languages.

Award
Grant
Meta AI
Announcement

Latest Blog Posts

Recent articles and insights

View all posts

Skills

PyTorch
Transformers
PEFT
Bitsandbytes
Diffusers
Hugging Face Ecosystem
NLTK
Scapy
FastAPI
Flask
Django
OpenCV
BeautifulSoup
Selenium
Pandas
Poetry
Langchain
React.js
Next.js
Express
Node.js
Vue.js
Bootstrap
Tailwind
Azure
Azure Machine Learning
AWS
AWS SageMaker
Docker
Kubernetes
Cloudflare
E2E Cloud
Databricks
Azure Data Factory
Apache Spark
Hadoop
Kafka
MongoDB
PostgreSQL
Firebase
Redis
MySQL
Supabase
Pinecone
FAISS
Qdrant
ChromaDb
HTML
CSS
JavaScript
TypeScript
Python
C/C++
SQL
Showing 54 total skills

Education

PES University

2021 - 2025
Bachelor's Degree in Computer Science
ADITHYA
S
KOLAVI
AI RESEARCHER, BUILDING GENERATIVE AI SOLUTIONS AT SCALE