

- OdiaGenAI is a collaborative initiative that conducts research on Generative AI and LLM for the Odia Language.- Odia Generative AI 
- Our Mission - OdiaGenAI initiative aims to utilize the power of AI in building Generative AI and LLM-based technologies and solutions for the overall development of Odisha and Odia language through collaboration among Odia technologists. - Our Initiatives - Build Pre-trained, Fine-Tuned, and Instructed LLM for Odia. The models, codes, and dataset will be freely available for non-commercial and research purposes. 
- Releases - Available releases - OdiaGenAI-BengaliGPT Model - odiagenAI-bengali-base-model-v1 is based on Llama-7b and finetuned with 252k Bengali instruction set. The instruction set is translated data from open-source resources, resulting in good Bengali instruction understanding and response generation capabilities. - OdiaGenAI-Llama2-7B Model - odia_llama2_7B_v1 is based on Llama2-7b and finetuned with 180k Odia instruction set. The instruction set is translated data from open-source resources and a prepared domain knowledge instruction set, resulting in good Odia instruction understanding and response generation capabilities. - OdiaGenAI-Olive Farm - OliveFarm is a cutting-edge web application crafted by the innovative minds at OdiaGenAI. It's designed to effortlessly generate LLM (Language Model) instruction sets in Indic languages. Presently, it offers support for Hindi and Odia, with seamless scalability to incorporate additional languages on the horizon. - OdiaGenAI-Olive Scrapper - Olive Scraper is a web scraping tool developed by OdiaGenAI for web scraping Odia contents from different sources (e.g., websites, PDF, DOC, etc.) - OdiaGenAI-Olive Whisper - Olive Whisper facilitates automatic speech recognition and translation tasks, enabling the conversion of spoken language into text across multiple languages, followed by translation into English. The primary objective of this initiative is to optimize the Whisper model using Lora. This optimization can be applied to both timestamped and non-timestamped data, as well as data without speech information. 
 - About Our Logo - The critically endangered Olive Ridley sea turtle is the world's smallest and most prevalent marine turtle. Travel thousands of kilometers in the ocean for nesting. The Gahirmatha Marine Sanctuary in Odisha is the largest known mass nesting rookery for olive ridley sea turtles worldwide. 
- Blog- Our Blog Posts Overview The widespread adoption of AI technology in recent years has led to transformative...Overview OdiaGenAI recently launched a fine-tuned Llama3-8B model (Llama3_8B_Odia_Unsloth)...March 26, 2024Overview In the new series of Small Language Models (SLM) for Indic languages, OdiaGenAI...Overview OdiaGenAI has launched its first pre-trained Odia Large Language Model (LLM), Qwen_1...December 31, 2023(Note:: The blog post is based on the paper titled "Building a Llama2-finetuned LLM for Odia...October 30, 2023(Note:: The blog post is based on the paper titled "Generative Chatbot Adaptation for Odia...Overview OdiaGenAI released a new BengaliGPT model as part of its initiative to build...Overview Generative artificial intelligence is indeed changing many aspects of our lives and...Overview Large Language Models (LLMs) are significantly impacting the AI community, and the...
- Internship (Open) - We are looking for a few interns for short/long term (3 months to 1 Year) to contribute to OdiaGenAI - Apply through this link: https://forms.gle/gTfnjjCgA1zFeEfZA 
- Team 
- Registration- You can fill up your details to communicate about our upcoming events (meetup, webinar, workshop).- Registration-Form
- Contact Us- Feel free to contact us for any feedback/suggestions/contributions. 



