MyGit

TEN-framework/TEN-Agent

Fork: 319 Star: 3355 (更新于 2024-12-18 12:29:30)

license: Apache-2.0

Language: Python .

TEN Agent is a conversational AI powered by TEN, integrating Gemini 2.0 Multimodal Live API, OpenAI Realtime API, RTC, and more. It offers real-time capabilities to see, hear, and speak, along with advanced tools like weather checks, web search, and RAG.

最后发布版本: 0.6.2 ( 2024-12-09 14:00:55)

官方网址 GitHub网址

TEN Agent banner

Follow on X Discussion posts Commits Issues closed PRs Welcome GitHub license

Discord TEN Community

TEN-framework%2FTEN-Agent | Trendshift

GitHub watchers GitHub forks GitHub stars

README in English 简体中文操作指南 日本語のREADME README in 한국어 README en Español README en Français README in Italiano

Getting Started   •   Create Extensions   •   TEN Framework Repository


🌟 Gemini Multimodal Live API Extension with RTC

Usecases

agent.theten.ai

Try Google Gemini Multimodal Live API with realtime vision and realtime screenshare detection capabilities, it is a ready-to-use extension, along with powerful tools like Weather Check and Web Search integrated perfectly into TEN Agent.


TEN Agent Usecases

Usecases


Ready-to-use Extensions

Ready-to-use Extensions


TEN Agent Playground in Local Environment

Prerequisites

Category Requirements
Keys • Agora App ID and App Certificate (free minutes every month)
OpenAI API key
Deepgram ASR (free credits available with signup)
FishAudio TTS (free credits available with signup)
Installation Docker / Docker Compose
Node.js(LTS) v18
Minimum System Requirements • CPU >= 2 Core
• RAM >= 4 GB

macOS: Docker setting on Apple Silicon

For Apple Silicon Macs, uncheck "Use Rosetta for x86/amd64 emulation" in Docker settings. Note: This may result in slower build times on ARM, but performance will be normal when deployed to x64 servers.

Docker Setting


Next step

1. Create .env file

cp ./.env.example ./.env

2. Setup Agora App ID and App Certificate in .env

AGORA_APP_ID=
AGORA_APP_CERTIFICATE=

3. Start agent development containers

docker compose up -d

4. Enter container

docker exec -it ten_agent_dev bash

5. Build agent

task use

6. Start the web server

task run

7. Edit playground settings

Open the playground at localhost:3000 to configure your agent.

  1. Select a graph type (e.g. Voice Agent, Realtime Agent)
  2. Choose a corresponding module
  3. Select an extension and configure its API key settings

Module Example

Running Gemini Realtime Extension

Open the playground at localhost:3000.

  1. Select voice_assistant_realtime graph
  2. Choose Gemini Realtime module
  3. Select v2v extension and enter Gemini API key

Gemini Realtime Playground


TEN Agent Components

Components Diagram


Stay Tuned

Before we get started, be sure to star our repository and get instant notifications for all new releases!

TEN star us gif


Join Community


Star History

Star History Chart


Code Contributors

TEN


Contribution Guidelines

Contributions are welcome! Please read the contribution guidelines first.


License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details.

最近版本更新:(数据更新于 2024-12-18 15:59:10)

2024-12-09 14:00:55 0.6.2

2024-11-18 10:37:55 0.6.1

2024-10-10 22:52:37 0.5.0

2024-09-05 08:58:16 v0.4.1

2024-08-19 10:46:36 v0.4.0

2024-08-15 10:18:20 v0.3.0

2024-08-14 23:19:45 v0.3.0-rc1

2024-07-26 14:02:11 v0.2.0

主题(topics):

agent, ai, asr, cpp, gemini, golang, gpt-4, gpt-4o, llm, low-latency, multimodal, nextjs14, openai, python, rag, real-time, realtime, tts, vision, voice-assistant

TEN-framework/TEN-Agent同语言 Python最近更新仓库

2025-01-18 21:26:31 sunnypilot/sunnypilot

2025-01-17 23:34:10 Skyvern-AI/skyvern

2025-01-17 19:49:33 ultralytics/ultralytics

2025-01-17 19:12:03 XiaoMi/ha_xiaomi_home

2025-01-17 08:27:45 comfyanonymous/ComfyUI

2025-01-17 04:56:19 QuivrHQ/MegaParse