speech-to-speech

Here are 44 public repositories matching this topic...

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

speech-to-text speech-to-speech large-language-models multimodal-large-language-models speech-language-model speech-interaction

Updated May 19, 2025
Python

IAHispano / Applio

Star

A simple, high-quality voice conversion tool focused on ease of use and performance.

text-to-speech ai voice speech pytorch tts rvc voice-conversion vc voice-cloning speech-to-speech vits voice-clone applio

Updated Jul 12, 2025
Python

opendilab / CleanS2S

Star

High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体！

python machine-learning streaming ai speech-synthesis speech-recognition speech-to-speech gpt-4o

Updated Jun 16, 2025
Python

VITA-MLLM / Freeze-Omni

Star

✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

speech speech-synthesis speech-recognition speech-to-speech large-language-models multimodal-large-language-models

Updated May 27, 2025
Python

amanvirparhar / weebo

Star

A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.

llama whisper kokoro speech-to-speech

Updated Jan 20, 2025
Python

MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not limited to end-to-end speech interaction, end-to-end speech translation and speech recognition.

speech-recognition speech-to-text speech-translation speech-to-speech large-language-models chatgpt gpt-4o speech-interaction

Updated Jan 8, 2025
Python

jesuscopado / samantha-os1-openai-realtime

Star

Samantha OS1 is a conversational AI assistant powered by the Realtime API from OpenAI

agent openai realtime-api speech-to-speech ai-agent

Updated Dec 27, 2024
Python

asiff00 / On-Device-Speech-to-Speech-Conversational-AI

Star

This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming architecture for fluid conversations with immediate responses and natural interruption handling.

tts vad audio-processing asr voice-assistant conversational-ai speech-to-speech ollama kokoro-tts

Updated Apr 17, 2025
Python

OpenBMB / UltraEval-Audio

Star

An easy-to-use, fast, and easily integrable tool for evaluating audio LLM

evaluation speech-recognition speech-to-text speech-to-speech

Updated Jul 15, 2025
Python

tarun7r / Vocal-Agent

Star

Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.

text-to-speech llama agents whisper kokoro speech-to-speech

Updated Jul 12, 2025
Python

taresh18 / conversify

Sponsor

Star

🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨

real-time webrtc tts speech-recognition speech-to-text stt conversational-ai speech-to-speech livekit llm

Updated Jun 25, 2025
Python

ictnlp / DASpeech

Star

Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".

machine-translation speech-translation speech-to-speech speech-to-speech-translation

Updated Jul 22, 2024
Python

codename0og / codename-rvc-fork-3

Star

Codename's rvc fork version 3, based on Applio.

text-to-speech ai voice speech pytorch tts rvc voice-conversion vc voice-cloning speech-to-speech vits applio retrieval-based-voice-conversion

Updated Jul 3, 2025
Python

liamdugan / speech-to-speech

Star

Code for the INTERSPEECH 2023 paper "Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models"

speech speech-processing speech-translation speech-to-speech simultaneous-translation

Updated Jan 14, 2025
Python

lugia19 / Echo-XI

Star

Speech to text to speech using Elevenlabs

python voice speech tts speech-synthesis speech-recognition speech-to-text speech-to-speech elevenlabs

Updated Jul 2, 2023
Python

winedarkmoon / ElevenGUI

Star

A user-friendly interface for ElevenLabs' API with added audio transcription capability.

python gui tts openai speech-to-text transcription speech-to-speech whisper-ai elevenlabs

Updated Jun 20, 2023
Python

PhamHuynhAnh16 / Vietnamese-RVC

Star

Dự án công cụ chuyển đổi giọng nói dành cho người Việt

Updated Jul 13, 2025
Python

mt-upc / iwslt-2022

Star

Systems submitted to IWSLT 2022 by the MT-UPC group.

translation adapters pretrained-models fine-tuning speech-translation speech-to-speech

Updated May 18, 2022
Python

Spac5y / Vocal-Agent

Star

A cutting-edge Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.

text-to-speech calendar email knowledgebase llama speech-to-text whisper vocal kokoro groq deepgram speech-to-speech

Updated Jul 15, 2025
Python

BlueBash / openai-realtime-api-demo

Star

An advanced speech-to-speech (S2S) voice assistant utilizing OpenAI’s Realtime API for ultra-low-latency, two-way audio streaming, real-time natural language understanding, and responsive, interactive dialogue through direct WebSocket communication.

python pyaudio websockets poetry openai wave realtime-api speech-to-speech

Updated Nov 4, 2024
Python

Improve this page

Add a description, image, and links to the speech-to-speech topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-to-speech topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-to-speech

Here are 44 public repositories matching this topic...

ictnlp / LLaMA-Omni

IAHispano / Applio

opendilab / CleanS2S

VITA-MLLM / Freeze-Omni

amanvirparhar / weebo

MooreThreads / MooER

jesuscopado / samantha-os1-openai-realtime

asiff00 / On-Device-Speech-to-Speech-Conversational-AI

OpenBMB / UltraEval-Audio

tarun7r / Vocal-Agent

taresh18 / conversify

ictnlp / DASpeech

codename0og / codename-rvc-fork-3

liamdugan / speech-to-speech

lugia19 / Echo-XI

winedarkmoon / ElevenGUI

PhamHuynhAnh16 / Vietnamese-RVC

mt-upc / iwslt-2022

Spac5y / Vocal-Agent

BlueBash / openai-realtime-api-demo

Improve this page

Add this topic to your repo