1 Star 0 Fork 0

Hugging Face 模型镜像/fish-agent-v0.1-3b

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
该仓库未声明开源许可证文件(LICENSE),使用请关注具体项目描述及其代码上游依赖。
克隆/下载
贡献代码
同步代码
取消
提示: 由于 Git 不支持空文件夾,创建文件夹后会生成空的 .keep 文件
Loading...
README
CC-BY-4.0
--- tags: - audio-to-audio - text-to-speech - speech-to-text license: cc-by-nc-sa-4.0 language: - zh - en - de - ja - fr - es - ko - ar pipeline_tag: audio-to-audio inference: false extra_gated_prompt: >- You agree to not use the model to generate contents that violate DMCA or local laws. extra_gated_fields: Country: country Specific date: date_picker I agree to use this model for non-commercial use ONLY: checkbox --- # Fish Agent V0.1 3B **Fish Agent V0.1 3B** is a groundbreaking Voice-to-Voice model capable of capturing and generating environmental audio information with unprecedented accuracy. What sets it apart is its semantic-token-free architecture, eliminating the need for traditional semantic encoders/decoders like Whisper and CosyVoice. Additionally, it stands as a state-of-the-art text-to-speech (TTS) model, trained on an extensive dataset of 700,000 hours of multilingual audio content. This model is a continue-pretrained version of Qwen-2.5-3B-Instruct for 200B voice & text tokens. ## Supported Languages The model supports the following languages with their respective training data sizes: - English (en): ~300,000 hours - Chinese (zh): ~300,000 hours - German (de): ~20,000 hours - Japanese (ja): ~20,000 hours - French (fr): ~20,000 hours - Spanish (es): ~20,000 hours - Korean (ko): ~20,000 hours - Arabic (ar): ~20,000 hours For detailed information and implementation guidelines, please visit our [Fish Speech GitHub repository](https://github.com/fishaudio/fish-speech). ## Citation If you find this repository helpful in your work, please consider citing: ```bibtex @misc{fish-agent-0.1, author = {Shijia Liao and Tianyu Li and Rcell and others}, title = {Fish Agent V0.1 3B}, year = {2024}, publisher = {GitHub}, journal = {GitHub repository}, howpublished = {\url{https://github.com/fishaudio/fish-speech}} } ``` ## License This model and its associated code are released under the BY-CC-NC-SA-4.0 license, allowing for non-commercial use with appropriate attribution.

简介

Mirror of https://huggingface.co/fishaudio/fish-agent-v0.1-3b 展开 收起
CC-BY-4.0
取消

发行版

暂无发行版

贡献者

全部

近期动态

加载更多
不能加载更多了
马建仓 AI 助手
尝试更多
代码解读
代码找茬
代码优化
1
https://gitee.com/hf-models/fish-agent-v0.1-3b.git
[email protected]:hf-models/fish-agent-v0.1-3b.git
hf-models
fish-agent-v0.1-3b
fish-agent-v0.1-3b
main

搜索帮助