Skip to content
View congzhang365's full-sized avatar

Highlights

  • Pro

Block or report congzhang365

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

2,153 261 Updated Jun 6, 2024

🤖 An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python 3.6 required.

Python 155 34 Updated Apr 2, 2025

A tool to detect potentially hallucinated or fabricated references in academic PDF papers.

Rust 171 18 Updated Apr 15, 2026

A Wes Anderson color palette for R

R 2,100 148 Updated Jun 13, 2024

A simple module to collect video, text, and metadata from Tiktok.

Python 450 55 Updated Oct 4, 2025

Friends don't let friends make certain types of data visualization - What are they and why are they bad.

R 7,042 285 Updated Sep 3, 2025

A list of papers for child ASR

52 6 Updated Oct 8, 2024

Convert questions from excel to QTI (Question Test Interoperability) format by pasting content in excel.

PHP 3 1 Updated Oct 4, 2019

a cheat-sheet for mathematical notation in code form

15,474 1,102 Updated Mar 8, 2022

The code for my bachelor thesis about pitch tracking with a LSTM neural network.

Python 2 Updated Oct 4, 2021

Make Praat Picture style plots of acoustic data

R 37 5 Updated Feb 4, 2026

Animation engine for explanatory math videos

Python 86,075 7,219 Updated Mar 26, 2026

Curve matching using Fréchet distance and Procrustes analysis in JS

TypeScript 150 17 Updated Jan 3, 2023

Chinese voice corpus. 中文语音语料,语音更加清晰自然,包含8个开源数据集,3200个说话人,900小时语音,1300万字。

727 125 Updated Jun 12, 2020

Multilingual G2P in 100 languages

Jupyter Notebook 382 33 Updated May 26, 2023

This repository contains scripts to build Youtube Gesture Dataset.

Python 132 17 Updated Nov 9, 2023

Automated Reproducible Acoustical Analysis

Python 164 20 Updated Aug 12, 2024

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

C 6,357 1,215 Updated Apr 6, 2026

A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)

Python 808 159 Updated Mar 25, 2026

COGS 532 - Theoretical Linguistics - Informatics Institute, METU

20 Updated Jan 9, 2025

COGS 543 - Computational Semantics

TeX 15 2 Updated Jan 28, 2024

The Munich Open-Source Large-Scale Multimedia Feature Extractor

C++ 799 100 Updated Jan 26, 2026

LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)

JavaScript 55,791 9,427 Updated Jul 16, 2025

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,201 6,674 Updated Sep 30, 2025

The repo provides information about KeSpeech dataset.

174 12 Updated Oct 13, 2022

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 22,095 2,698 Updated Jan 23, 2026

Phonological CorpusTools

Python 121 17 Updated May 24, 2025

This repository contains data and code as well as instructions to reproduce our experiments in (Dominguez et al. 2018, accepted at LREC2018)

2 Updated Feb 19, 2018

Materials related to our Sinn und Bedeutung 23 paper

R 40 11 Updated May 28, 2020
Next