AI & ML interests
Odia LLM, Odia Generative AI, Odia InstructLLM
Recent Activity
About
OdiaGenAI is an open research initiative advancing Generative AI, LLMs, and multimodal technologies for Odia and low-resource Indic languages through community-driven, open-source collaboration.
Vision
Empowering Odia and low-resource Indic languages through open, multimodal, and community-owned AI.
Related Hugging Face Organizations
OdiaGenAI collaborates with and maintains close ties to other HF organizations that focus on Odia and Indic LLMs:
🔗 OdiaGenAI – Main organization for Odia datasets, models, and AI tools (text, speech, OCR, multimodal).
https://huggingface.co/OdiaGenAI🔗 OdiaGenAI‑LLM – Focused LLM organization with additional Odia and Indic‑centric model releases (e.g., Mistral, LLaMA variants).
https://huggingface.co/OdiaGenAI‑LLM🔗 odiagenmllm – Organization hosting multilingual and Odia‑focused LLM projects, benchmarks, and community models.
https://huggingface.co/odiagenmllm🔗 OdiaGenAIdata – Dataset‑centric organization hosting large corpora for Odia pretraining and evaluation (if separate).
https://huggingface.co/OdiaGenAIdata🔗 OdiaGenAIOCR – Organization dedicated to Odia OCR datasets, models, and tools for printed and handwritten text recognition.
https://huggingface.co/OdiaGenAIOCR🔗 Hindi‑data‑hub – A community‑driven hub for Hindi language datasets and models, supporting Indic language research.
https://huggingface.co/Hindi-data-hub🔗 HydraIndicLM – An Indic LLM initiative focused on building and hosting language models and benchmarks for multiple Indic languages.
https://huggingface.co/HydraIndicLM🔗 ShopIntel – Organization oriented toward multilingual models and industry‑focused AI research, including support for Indic languages.
https://huggingface.co/ShopIntel🔗 Indic‑Benchmark – Initiative providing benchmarks and evaluation suites for multiple Indic languages across NLP tasks.
https://huggingface.co/Indic-Benchmark
Objectives
OdiaGenAI focuses on:
- Foundation Models for Odia and Indic Languages
- Instruction-tuned and Task-specific LLMs for Indic Use Cases
- Speech and OCR Technologies for Odia and Indic Languages
- Multimodal AI (Text + Vision + Speech) for Low-resource Languages
- Open Data Creation, Benchmarks, and Evaluation Frameworks
All outputs are released for research and non-commercial use.
Why OdiaGenAI?
- Low-resource challenge — Odia support in existing LLMs is limited due to scarce training data.
- Openness — Proprietary models restrict access; we provide free, open models and datasets.
- Ethics & privacy — Transparent data practices and community ownership of language tech.
Focus Research Areas
1. Literature & Benchmarking
Survey and evaluate generative AI and multimodal models for Odia.
2. Development
Curate datasets; build tokenizers, models, and training pipelines.
3. Deployment & Access
Host models and tools via Hugging Face, along with APIs and demos.
Who Can Use OdiaGenAI?
- Researchers, students, developers, and NGOs.
Models and datasets are available via Hugging Face for research and non-commercial purposes. Contact us for special use cases.
Key Application Areas
- Education
- Healthcare
- Governance
Contributors
-
About our logo: The critically endangered Olive Ridley sea turtle is the world's smallest and most prevalent marine turtle. Travel thousands of kilometers in the ocean for nesting. The Gahirmatha Marine Sanctuary in Odisha is the largest known mass nesting rookery for olive ridley sea turtles worldwide.
Citation
If you find this repository useful, please consider giving 👏 and citing:
@misc{OdiaGenAI,
author = {Shantipriya Parida and Sambit Sekhar and Swateek Jena and Abhijeet Parida and Satya Ranjan Dash},
title = {OdiaGenAI: Generative AI and LLM Initiative for the Odia Language},
year = {2023},
publisher = {Hugging Face},
journal = {Hugging Face repository},
howpublished = {\url{https://huggingface.co/OdiaGenAI}},
}
License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
spaces
8
Odia Qa Reasoning Generator
Odia QnA with reasoning generation app.
Odia Ocr Annotation Platform
Olive Chat
Chat with an Odia language AI assistant
Olive Farm
Generate LLM instructions from URLs, documents, or text
Olive Whisper ASR
Convert spoken Odia to text
