-
Shanghai Jiao Tong University
- Shanghai
-
17:06
(UTC +08:00) - whlzy.github.io
- @zeyu_whlzy
Stars
A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
This tool uses AI to evaluate your pronunciation.
Sample code for the Microsoft Cognitive Services Speech SDK
Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.
解决Cursor在免费订阅期间出现以下提示的问题: You've reached your trial request limit. / Too many free trial accounts used on this machine. Please upgrade to pro. We have this limit in place to prevent abuse. Please l…
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…
Multilingual Voice Understanding Model
This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Azure OpenAI code resources for using gpt-4o-realtime capabilities.
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Translate your application with Languine CLI powered by AI.
End-to-end stack for WebRTC. SFU media server and SDKs.
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web without worrying about infrastructure.
Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable an…
📨 The ultimate social media scheduling tool, with a bunch of AI 🤖
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
The Repository of Chat App development UI PART using Flutter (Youtube Series)
WhatsApp Clone provides an in-depth view of implementation on how to create a full-stack, mobile, hybrid web application from scratch using React Native, ReactJs, Typescript, NodeJs, ExpressJs, Mon…
https://www.tortilla.academy/Urigo/WhatsApp-Clone-Tutorial
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI