UI-TARS-desktop

An GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.

最近更新: 9小时前

g3

G3 Project - Enterprise-oriented Generic Proxy Solutions

最近更新: 1天前

rdbStore

字节跳动鸿蒙生态数据库组件,支撑字节系鸿蒙应用数据库相关能力。

最近更新: 2天前

gpu4pyscf

A plugin to use Nvidia GPU in PySCF package

最近更新: 2天前

X-Dyna

[ArXiv 2024] X-Dyna: Expressive Dynamic Human Image Animation

最近更新: 3天前

Shot2Story

A new multi-shot video understanding benchmark Shot2Story20K with detailed shot-level captions and comprehensive video summaries.

最近更新: 3天前

DeepHall

Simulating the fractional quantum Hall effect with neural network variational Monte Carlo

最近更新: 3天前

pasa

PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search ...

最近更新: 6天前

vArmor

vArmor is a cloud native container sandbox based on LSM. It includes multiple built-in protection rules that are ready to use out of the box.

最近更新: 7天前

UI-TARS

最近更新: 7天前

sonic

A blazingly fast JSON serializing & deserializing library

最近更新: 7天前

Elkeid

Elkeid is a Cloud-Native Host-Based Intrusion Detection solution project to provide next-generation Threat Detection and Behavior Audition with mod...

最近更新: 7天前

1d-tokenizer

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

最近更新: 8天前

Agent-R

最近更新: 8天前

tarsier

最近更新: 9天前

VideoWorld

最近更新: 11天前

mona

mona is for developing merchant's app

最近更新: 11天前

arishem

A high performance and lightweight rule engine written by Golang.

最近更新: 11天前

Bytedance-UnionAD

Pod for Bytedance-UnionAD only support x86_64, armv7, arm64, i386.

最近更新: 11天前

Valley

Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.

最近更新: 12天前
成就
304
Star
73
Fork
成员(3)
551147 normalcoder 1578927376
诺墨
1305863 starryc 1594099416
嘻酱
7825243 dengyiyun 1599025613
长花天门冬

搜索帮助