Skip to content
View aidyai's full-sized avatar
🧩
PROBLEM SOLVING
🧩
PROBLEM SOLVING

Organizations

@iko-ai

Block or report aidyai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
TypeScript 7 2 Updated Jun 19, 2024
Jupyter Notebook 1 Updated Jan 8, 2025

A Python package that combines shadow removal preprocessing with state-of-the-art OCR for accurate handwriting transcription. The package offers both local inference using MiniCPM-V and cloud-based…

Python 3 1 Updated Nov 3, 2024

Tenacious tool calling built on LangGraph

Python 470 41 Updated Mar 6, 2025
Python 258 96 Updated Feb 28, 2025

A fast multimodal LLM for real-time voice

Python 3,713 272 Updated Feb 14, 2025

first base model for full-duplex conversational audio

Python 1,716 114 Updated Jan 5, 2025
HTML 5 Updated Oct 24, 2024

Blazing fast whisper turbo for ASR (speech-to-text) tasks

Python 197 9 Updated Oct 20, 2024

📝 Generate personalized writing content

TypeScript 16 4 Updated Oct 2, 2024

An open-source implementaion for fine-tuning Molmo-7B-D and Molmo-7B-O by allenai.

Python 53 5 Updated Jan 24, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,791 631 Updated Mar 13, 2025

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 13,344 1,374 Updated Mar 5, 2025

Chat with Hacker News using natural language. Built with OpenAI Functions and Vercel AI SDK.

TypeScript 1,166 162 Updated Oct 30, 2023

DocLing: Multilingual Document Understanding

Jupyter Notebook 3 2 Updated Oct 19, 2024

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,874 417 Updated Mar 5, 2025

Build your own generative UI chatbot using the Vercel AI SDK and Google Gemini

TypeScript 960 361 Updated Oct 15, 2024

This is the Placeholder for Whisper Model Finetune

Jupyter Notebook 2 2 Updated Jul 24, 2024

FashionCLIP is a CLIP-like model fine-tuned for the fashion domain.

Python 371 44 Updated Jan 30, 2025

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 34,655 2,617 Updated Mar 14, 2025

Python scraper based on AI

Python 18,609 1,574 Updated Mar 13, 2025

Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript

Go 572 19 Updated Jul 2, 2024
Python 1 Updated Aug 9, 2024

Interview Prep

1 Updated Feb 8, 2024

An app for finger vein scanning device management and authentication

JavaScript 2 Updated Jul 17, 2022

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 10,150 1,533 Updated Jan 13, 2025

Implementation of Prompt-to-Prompt Image Editing with Cross Attention Control

Python 12 1 Updated Apr 5, 2023

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 141,204 28,283 Updated Mar 14, 2025
Next