#bioinformatics

  1. needletail

    FASTX parsing and k-mer methods

    v0.6.3 9.8K #bioinformatics #fasta #k-mer #fastq
  2. rust-lapper

    A fast and easy interval overlap library

    v1.2.0 48K #interval-tree #bioinformatics #tree
  3. rust-htslib

    HTSlib bindings and a high level Rust API for reading and writing BAM files

    v1.0.0 8.8K #bam #sequencing #bioinformatics #pileup
  4. bio

    A bioinformatics library for Rust. This library provides implementations of many algorithms and data structures that are useful for bioinformatics, but also in other fields.

    v3.0.0 14K #bioinformatics #continuous-integration #data-structures #bioinformatics-data-structures #algorithm #integration-tests
  5. noodles

    Bioinformatics I/O libraries

    v0.104.0 16K #bioinformatics #bam #sam #bio
  6. blutils-cli

    The CLI port of the blutils library

    v8.3.1 1.8K #blast #bioinformatics #ncbi #similarity-analysis #dna
  7. minimap2

    Bindings to libminimap2

    v0.1.30+minimap2.2.30 600 #genomics #bioinformatics #fasta #fastq
  8. bio-seq

    Bit packed and well-typed biological sequences

    v0.14.8 1.1K #genomics #bioinformatics #dna-sequence #k-mer #dna #bio
  9. rasusa

    Randomly subsample reads or alignments

    v2.2.2 750 #bioinformatics #fastq #coverage #subsampling #alignment
  10. bitnuc

    efficient nucleotide sequence manipulation using 2-bit and 4-bit encodings

    v0.4.1 1.1K #nucleotide-sequence #dna-sequence #bioinformatics #nucleotide #2bit #dna
  11. htsget-config

    Used to configure htsget-rs by using a config file or reading environment variables

    v0.21.0 #htsget #amazon-s3 #htsget-rs #config-file #location #server-authorization #bioinformatics #ticket #env-vars #storage-service
  12. rust-sasa

    RustSASA is a Rust library for computing the absolute solvent accessible surface area (ASA/SASA) of each atom in a given protein structure using the Shrake-Rupley algorithm

    v0.9.2 600 #bioinformatics #protein-structure #rust
  13. lrge

    Genome size estimation from long read overlaps

    v0.2.1 330 #genome-size-estimation #long-read-overlap #bioinformatics #genome-size #long-read
  14. perbase

    Fast and correct perbase BAM/CRAM analysis

    v1.4.0 #genomics #bioinformatics #depth #coverage
  15. htsget-search

    The primary mechanism by which htsget-rs interacts with, and processes bioinformatics files. It does this by using noodles to query files and their indices.

    v0.12.6 #htsget #bioinformatics #noodles #byte-range #htsget-rs #amazon-s3 #file-processing #bgzf #format-file
  16. finch

    min-wise independent permutation locality sensitive hashing ('MinHashing') for genomic data and command-line utility for manipulation

    v0.6.2 #bioinformatics #sketch #min-hash
  17. thirdkind

    Read phylogenetic tree(s) in newick, phyloXML or recPhyloXML file and build a svg representation of the tree(s) allowing 1, 2 or 3 reconciliation levels

    v3.13.5 #bioinformatics #newick #phylogeny #recphyloxml
  18. lowess

    (Locally Weighted Scatterplot Smoothing)

    v1.1.1 400 #smoothing #statistics #bioinformatics #regression
  19. atg

    Convert transcripts between different file formats

    v0.8.7 #genomics #bioinformatics #transcript #genetics #transcriptomics
  20. crispr_screen

    A fast and configurable differential expression analysis tool for CRISPR screens

    v0.3.21 1.3K #bioinformatics #sg-rna #crispr #degs #cli
  21. kira-qc

    FastQC-compatible QC tool written in Rust

    v0.1.0 #genomics #bioinformatics #fastq #quality-control #dna
  22. bed-reader

    Read and write the PLINK BED format, simply and efficiently

    v1.0.6 2.1K #genomics #bioinformatics #genotype #plink #snps
  23. ragc-cli

    Command-line interface for the ragc AGC genome compression tool

    v0.1.1 #genomics #bioinformatics #compression #agc
  24. grepq

    quickly filter fastq files

    v1.6.5 3.4K #bioinformatics #fastq #json
  25. sprocket

    A command line tool for working with Workflow Description Language (WDL) documents

    v0.21.1 120 #workflow-description-language #language-server #input-file #validation #command-line #lint #static-analysis #bioinformatics #lsp #web-api
  26. taxonomy

    Routines for loading, saving, and manipulating taxonomic trees

    v0.10.1 750 #bioinformatics
  27. hashfasta

    Very quickly compute hashes for FASTA/FASTQ files considering only the sequence content

    v1.0.0 #bioinformatics #fasta #hash #fastq #sequencing
  28. peacoqc-cli

    Command-line tool for PeacoQC flow cytometry quality control

    v0.2.0 #quality-control #flow-cytometry #bioinformatics #flow-control #command-line
  29. bqtools

    A command-line tool for interacting with BINSEQ file formats

    v0.5.3 #bioinformatics #sequencing #binseq #fastx #bio
  30. packed-seq

    Constructing and iterating packed DNA sequences using SIMD

    v4.4.1 #bioinformatics #dna #simd #packed-simd #bio
  31. primerpincer

    A CLI primer trimming tool for long-read sequencing data

    v0.8.0 #bioinformatics #sequencing #fastq #primers #microbiome
  32. jam-rs

    Just another (genomic) minhash (Jam) implementation in Rust

    v0.9.10 #bioinformatics #containment #metagenomics #jaccard
  33. simd-minimizers

    A SIMD-accelerated library to compute random minimizers

    v2.3.0 #bioinformatics #minimizers #bio #dna
  34. ibu

    high throughput binary encoding genomic sequences

    v0.2.1 300 #genomics #binary-encoding #bioinformatics #compression
  35. kira-nuclearqc

    Deterministic CLI for nuclear state and transcriptional plasticity analysis from 10x scRNA-seq MTX inputs

    v0.1.1 #bioinformatics #cell #quality-control #oncology #nucleus
  36. lightmotif

    A lightweight platform-accelerated library for biological motif scanning using position weight matrices

    v0.10.0 1.0K #genomics #bioinformatics #matrix #motif #pssm
  37. fastLowess

    High-performance LOWESS (Locally Weighted Scatterplot Smoothing)

    v1.1.2 #fastlowess #statistics #smoothing #lowess #bioinformatics #regression
  38. nwr

    nwr is a command line tool for newick and taxonomy

    v0.8.5 2.0K #newick #taxonomy #command-line-tool #testing #assembly #ncbi #bioinformatics #phylogenetic-tree
  39. loess-rs

    LOESS (Locally Estimated Scatterplot Smoothing) implementation in Rust

    v0.2.2 #smoothing #statistics #bioinformatics #regression
  40. kractor

    Extract reads from a FASTQ file based on taxonomic classification via Kraken2

    v4.0.0 #bioinformatics #fastq #metagenomics #taxonomy #kraken
  41. sequintools

    A suite of tools for manipulating and reporting on NGS data that has sequins added to the sample

    v0.11.1 #sample #sequencing #added #ngs #reporting #input-file #docker #bioinformatics #file-data
  42. sassy

    Approximate string matching using SIMD

    v0.1.10 100 #bioinformatics #fuzzy-search #string-search #simd #search
  43. htsget-storage

    Storage interfaces and abstractions for htsget-rs

    v0.5.6 #htsget #amazon-s3 #bioinformatics #local-storage #htsget-rs #cloud-storage #storage-abstraction #local-filesystem #data-url #storage-interface
  44. fastx

    reads Fasta and FastQ files with little overhead

    v0.6.1 #bioinformatics #fasta #fastq #sequencing #genome
  45. crankshaft

    A headless task execution engine that supports local, cloud, and HPC

    v0.8.0 #task-execution #task-execution-engine #headless #hpc #cloud #bioinformatics
  46. quadrank

    Fast rank over binary and size-4 DNA alphabets

    v0.2.0 #bioinformatics #data-structures #prefetch #rank #succinct
  47. flow-fcs

    High-level Flow Cytometry Standard (FCS) file struct and operations

    v0.2.1 #flow-cytometry #fcs #bioinformatics #parser
  48. rumina

    High-throughput UMI-aware deduplication of next-generation sequencing data

    v0.99.5 600 #bam #deduplicate #barcode #sequencing #bioinformatics
  49. simdna

    High-performance SIMD-accelerated DNA sequence encoding supporting all IUPAC nucleotide codes

    v1.0.2 #bioinformatics #iupac #dna
  50. light_phylogeny

    Methods and functions for phylogeny

    v2.8.6 #bioinformatics #phylogeny #phylogenetic-evolution #recphyloxml #evolution #phylogenetic
  51. seqsizzle

    A pager for viewing FASTQ and FASTA files with fuzzy matching, allowing different adaptors to be colored differently

    v0.4.1 #bioinformatics #fastq #pager #visualization #genomic-sequencing
  52. modtector

    A high-performance modification detection tool in Rust

    v0.15.4 #bioinformatics #rna #detect #htslib #modification
  53. ontolius

    A fast and safe crate for working with biomedical ontologies

    v0.7.2 160 #bioinformatics #ontology #hpo #m-ax-o #go
  54. minimap2-temp

    Bindings to libminimap2

    v0.1.33+minimap2.2.28 190 #bioinformatics #fasta #fastq
  55. kun_peng

    Kun-peng: an ultra-fast, low-memory footprint and accurate taxonomy classifier for all

    v0.7.5 480 #bioinformatics #metagenomics #microbiome #exposome
  56. wdl-ast

    An abstract syntax tree for Workflow Description Language (WDL) documents

    v0.21.1 210 #ast #workflow-description-language #sprocket #document #immutability #bioinformatics
  57. fasten

    A set of scripts to run basic analysis on fastq files

    v0.9.0 #bioinformatics #fastq #streaming
  58. biometal

    ARM-native bioinformatics library with streaming architecture and evidence-based optimization

    v1.10.0 #genomics #bioinformatics #neon #arm
  59. seqtkrs

    reimplementation of seqtk, a fast and lightweight tool for processing biological sequences in FASTA/FASTQ format

    v0.1.1 #genomics #fasta-sequence #bioinformatics #bioinformatics-sequence #fasta #fastq
  60. atria-rs

    running the Ablatio Triadum (ATria) centrality algorithm (Cickovski et al, 2015, 2017)

    v1.3.0 #graph #centrality #algorithm #bioinformatics #pluma
  61. intspan

    Command line tools for IntSpan related bioinformatics operations

    v0.8.7 #testing #spanr #bioinformatics #int-spans #range
  62. wdl-grammar

    A parse tree for Workflow Description Language (WDL) documents

    v0.21.0 460 #workflow-description-language #diagnostics #syntax-tree #document-parser #sprocket #parse-tree #cst #bioinformatics #rowan #command-line-task
  63. barbell

    Extremely fast and accurate Nanopore demultiplexing

    v0.3.1 #bioinformatics #nanopore #demultiplexing #ont
  64. rust-bio-tools

    A set of fast and robust command line utilities for bioinformatics tasks based on Rust-Bio

    v0.42.2 4.1K #bioinformatics #genomics #bam #fastq #rust-bio #fuzzy-matching #csv #html-reports #bcf #plot
  65. flow-plots

    Package for drawing and interacting with plots in flow cytometry data

    v0.2.1 #plot #flow-cytometry #bioinformatics
  66. sigalign

    A Similarity-Guided Alignment Algorithm

    v0.4.2 #bioinformatics #nucleotide #bio
  67. orphos-cli

    Command-line interface for Orphos, a tool for finding protein-coding genes in microbial genomes

    v0.1.0 #genomics #bioinformatics #prodigal #bio
  68. kira-ls-aligner

    Unified short- and long-read sequence aligner written in Rust 2024. It combines minimap2-style minimizers and chaining with BWA-MEM2-style exact-match anchoring and output semantics…

    v0.1.1 #genomics #sam #bioinformatics #sequence-alignment #bio #dna
  69. fastq-fix-i5

    Rewrite FASTQ headers by reverse-complementing only the i5 (P5/index2) part in ...:i7+i5

    v1.0.0 #fastq #bioinformatics #sequencing
  70. hgvs-weaver

    High-performance HGVS variant mapping and validation engine

    v0.2.0 #genomics-variant #genomics #hgvs #variant #bioinformatics
  71. kmerust

    A fast, parallel k-mer counter for DNA sequences in FASTA and FASTQ files

    v0.3.2 #genomics #bioinformatics #k-mer #fasta #dna
  72. voronota-ltr

    Voronota-LT is an alternative version of Voronota for constructing tessellation-derived atomic contact areas and volumes

    v0.1.0 #bioinformatics #tessellation #molecule #sasa #voronoi
  73. seqkmer

    High-performance FASTA/FASTQ IO and minimizer-based k-mer analysis utilities for Rust bioinformatics pipelines

    v0.1.5 #genomics #bioinformatics #dna-sequence #k-mer #dna
  74. sgcount

    A fast and flexible sgRNA counter

    v0.1.35 1.0K #bioinformatics #crispr-sequencing #sequencing #crispr #sg-rna
  75. gdock

    Fast information-driven protein-protein docking using genetic algorithms and physics-based scoring

    v2.0.0-alpha.3 #bioinformatics #docking #genetics #molecular #protein
  76. kira-protein-pruner

    High-performance protein analysis tools

    v0.1.1 #protein #bioinformatics #proteomics #cli
  77. wdl-analysis

    Analysis of Workflow Description Language (WDL) documents

    v0.17.2 190 #workflow-description-language #document-analysis #sprocket #workflow-analysis #language-analysis #bioinformatics #lsp #command-line-task #language-server-protocol #input-file
  78. noodles-vcf

    Variant Call Format (VCF) reader and writer

    v0.83.0 18K #variant-call-format #bioinformatics #bio
  79. librna-sys

    Low-level bindings for the ViennaRNA library

    v0.3.1 #bioinformatics #rna #viennarna #ffi #sys
  80. oxbow

    Read conventional genomic file formats as data frames and more via Apache Arrow

    v0.5.1 #genomics #bioinformatics #dataframe #ngs #arrow
  81. salti

    A modern, fast, multiple sequence alignment browser - built for the terminal

    v0.5.1 #bioinformatics #ratatui #msa
  82. compact-genome

    Representation of genomes

    v12.5.0 260 #string-representation #genome #bioinformatics #compact-representation #compact-string
  83. fastars

    Ultra-fast QC and trimming for short and long reads

    v0.1.0 #bioinformatics #trimming #qc #fastq #ngs
  84. gffx

    An ultra-fast and memory-efficient toolkit for querying GFF files, written with Rust

    v0.4.0 500 #genomics #bioinformatics #gff-file
  85. bio-forge

    A pure Rust library and CLI for the automated repair, preparation, and topology construction of biological macromolecules

    v0.4.0 #bioinformatics #pdb #proteins #nucleic-acids #chemoinformatics #bio
  86. fxtools

    A collection of commandline Fasta/Fastq utility tools

    v0.2.39 2.3K #bioinformatics #sequencing
  87. kira-biodata-manager

    Reproducible bio-data manager with a project-local store and a shared global cache. kira-bm it's like npm/cargo/pip for bioinformatics.

    v0.1.1 #bioinformatics #genomics #pdb #proteomics #dna
  88. wdl-format

    Formatting of WDL (Workflow Description Language) documents

    v0.15.1 210 #formatting #workflow-description-language #sprocket #document #validation #bioinformatics
  89. pdbrust

    A comprehensive Rust library for parsing and analyzing Protein Data Bank (PDB) files

    v0.7.0 #protein-pdb #protein #bioinformatics #chemistry #pdb #structural-biology
  90. kira-mmcif

    Low-level, streaming mmCIF parser focused on protein coordinates

    v0.1.1 #bioinformatics #proteomics #proteins #mmcif #gemmy
  91. peacoqc-rs

    PeacoQC quality control algorithms for flow cytometry

    v0.2.1 #quality-control #flow-cytometry #bioinformatics
  92. htsget-http

    handling HTTP in htsget-rs

    v0.8.5 #htsget #htsget-rs #web-server #convert #error-handling #json-response #bioinformatics
  93. tgv

    Explore genomes in the terminal. Light, blazing fast 🚀, vim-motion.

    v0.1.0 490 #genome #vcf #bam-file #explore #light #local-cache #ratatui #bioinformatics #bed
  94. fasterp

    High-performance FASTQ preprocessing tool - often faster than fastp with the same interface

    v0.2.1 #genomics #bioinformatics #quality-control #fastq
  95. twitcher

    Find template switch mutations in genomic data

    v0.1.3 #genome #bioinformatics #htslib #mutation
  96. sequenceprofiler

    sequence similarity based on identity kmers and all sequence profiling under one rust crate

    v0.4.0 #genome #k-mer #fasta #identity #graphs #jellyfish #profiling #bioinformatics #sam
  97. fastLoess

    High-level, parallel LOESS (Locally Estimated Scatterplot Smoothing) implementation in Rust

    v0.1.0 #fastloess #smoothing #statistics #loess #bioinformatics #regression
  98. parasail-rs

    Rust bindings and wrapper for parasail, a SIMD C library for pairwise sequence alignment

    v0.9.0 #sequence-alignment #bioinformatics #rust #ffi
  99. efficient_pca

    Principal component computation using SVD and covariance matrix trick

    v0.1.8 2.7K #genomics #pca #bioinformatics #svd #machine-learning
  100. wham

    weighted histogram analysis method

    v1.1.4 120 #statistics #histogram #molecular-dynamics #math #bioinformatics #math-statistics
  101. phylo

    An extensible Phylogenetics library written in rust

    v2.0.1 1.0K #phylogenetic-tree #phylogenetic #bioinformatics
  102. flow-gates

    Package for drawing and interacting with gates in flow cytometry data

    v0.2.2 #gates #flow-cytometry #bioinformatics #cytometry #flow
  103. mbf-fastq-processor

    The fast, reliable multitool of FASTQ processing

    v0.8.1 #bioinformatics #fastq #sequencing
  104. deacon

    Fast DNA search and [host] depletion using minimizers

    v0.13.2 #genomics #bioinformatics #filtering #decontamination
  105. dreid-forge

    A pure Rust library and CLI that automates DREIDING force field parameterization by orchestrating structure repair, topology perception, and charge calculation for both biological and chemical systems

    v0.3.0 #force-field #bioinformatics #molecular-dynamics #dreiding #chemoinformatics
  106. ARGenus

    ARG detection and genus-level classification using flanking sequence analysis

    v0.2.1 #argenus #bioinformatics #genomics #arg #metagenomics #amr
  107. biotools

    bioinformatics CLI tools for sequence analysis and manipulation

    v1.1.0 600 #sequence-analysis #bioinformatics
  108. wdl-doc

    Documentation generator for Workflow Description Language (WDL) documents

    v0.12.1 140 #documentation-generator #workflow-description-language #sprocket #header #bioinformatics
  109. rust_deseq2

    A pure Rust implementation of DESeq2 for differential expression analysis of RNA-seq data

    v0.1.0 #rna-seq #genomics #bioinformatics
  110. nohuman

    Remove human reads from a sequencing run

    v0.5.0 #bioinformatics #metagenomics #contamination
  111. securiety

    parsing and validating CURIE IDs

    v0.2.8 #validation #ontology #bioinformatics #curie #parser
  112. microBioRust

    Microbiology friendly bioinformatics Rust functions

    v0.1.3 #microbiorust #genomics #bioinformatics #sequence-analysis #bio
  113. redicat

    RNA Editing Cellular Assessment Toolkit: A highly parallelized utility for analyzing RNA editing events in single-cell RNA-seq data

    v0.4.1 #rna-seq #single-cell #bioinformatics #rna-editing
  114. sview-fmindex

    FM-index library with slice view architecture for efficient text indexing and pattern matching

    v0.1.2 #fm-index #pattern-matching #bioinformatics #fmindex #burrows-wheeler
  115. dnacomb

    Count the occurances of structured sequence reads and compare to an expected library

    v0.5.0 #bioinformatics #read #sequencing #library #bio
  116. haddock-restraints

    Generate restraints to be used in HADDOCK

    v0.11.1 #bioinformatics #restraints #haddock #structural-biology #protein-docking
  117. nafcodec

    Rust coder/decoder for Nucleotide Archive Format (NAF) files

    v0.3.1 420 #archive #bioinformatics #nucleotide #biology
  118. ref-solver

    Solve reference genome identification from BAM/SAM headers

    v0.1.0 #genomics #sam #bam #bioinformatics #reference-genome
  119. lightmotif-tfmpvalue

    Rust reimplementation of TFMPvalue for the lightmotif crate

    v0.10.0 650 #bioinformatics #pssm #pvalue #motif #bio
  120. rebgzf

    Efficient gzip to BGZF transcoder using Puffin-style half-decompression

    v0.1.0 #gzip-deflate #gzip #deflate #bioinformatics #transcoding
  121. ontology-registry

    that lets you download, store and register ontologies

    v0.3.6 160 #ontology #obo #bioinformatics #obolibrary #hpo
  122. seqwish

    A variation graph inducer - build pangenome graphs from pairwise alignments

    v0.1.3 #genomics #bioinformatics #variation-graph #pangenome #gfa
  123. single_rust

    Single-cell analysis in Rust

    v0.5.8 #single-cell #bioinformatics #differential-expression-analysis #data-analysis #sparse-matrix #metrics #quality-control #gene #pca #qc
  124. sweepga

    Efficient pangenome alignment filtering and sparsification tool

    v0.1.1 #genomics #bioinformatics #pangenome #alignment #filtering
  125. seq_io_parallel

    A map-reduce style parallel extension to seq_io

    v0.2.1 700 #bioinformatics #fastq #map-reduce #fasta
  126. ensemblcov

    human genomics

    v0.4.0 #genomics #auto-generate #human #extract #bioinformatics #differential-expression #rustup-component-add #ensembl #rustup-target-add #multi-threading
  127. noodles-csi

    Coordinate-sorted index (CSI) format reader and writer

    v0.53.0 22K #bioinformatics #noodles #csi #bio
  128. ska

    Split k-mer analysis

    v0.5.1 #genomics #bioinformatics #k-mer #genomics-sequencing #sequencing
  129. odgi-ffi

    A safe, ergonomic Rust wrapper for the odgi pangenome graph tool

    v1.1.3 390 #graph #pangenome #odgi #bioinformatics #api-bindings
  130. kira-fastq

    High-performance FASTQ reader with mmap-first design. Supports plain, gzip, and BGZF inputs; optional multi-line parsing; and explicit paired-end reading.

    v0.1.0 #genomics #bioinformatics #sequence-alignment
  131. doiTAG

    doiTAG for sequence DOIs

    v0.3.0 #doitag #doi #sequence #bioinformatics #command #fasta #debugging #gene
  132. vcf-reformatter

    Fast VCF file parser and reformatter with VEP and SnpEff annotation support which can output to MAF

    v0.3.0 #vcf #bioinformatics #vep #maf #snpeff
  133. noodles-bcf

    Binary Call Format (BCF) reader and writer

    v0.81.0 2.2K #bioinformatics #binary-call-format #noodles #bio
  134. wdl-lsp

    Language Server Protocol implementation for WDL

    v0.17.1 140 #language-server-protocol #workflow-description-language #sprocket #document #workflow-analysis #lsp #bioinformatics #linting #analyzing
  135. atglib

    handle transcripts for genomics and transcriptomics

    v0.2.3 500 #genomics #bioinformatics #transcript #transcriptomics #genetics
  136. wdl-engine

    Execution engine for Workflow Description Language (WDL) documents

    v0.12.1 140 #workflow-description-language #directory #sprocket #input-file #document #workflow-engine #bioinformatics #docker-container #command-line-task #shell-completion
  137. noodles-sam

    Sequence Alignment/Map (SAM) format reader and writer

    v0.81.0 5.0K #sam #sequence-alignment #bioinformatics #bio
  138. selexqc

    High-performance parallel RNA Capture-SELEX library quality control

    v0.1.0 #bioinformatics #fasta-sequence #bioinformatics-sequence #fasta #fastq #selex
  139. biodiff

    Compare binary files using sequence alignment

    v1.2.1 700 #sequence-alignment #diff #text-search #binary #ascii #binary-diff #cursive #wfa2 #biology #bioinformatics
  140. sourmash

    tools for comparing biological sequences with k-mer sketches

    v0.21.0 #bioinformatics #min-hash #minhash
  141. refman

    A command-line manager for bespoke reference datasets used in bioinformatic analyses

    v1.3.4 480 #bioinformatics #reproducibility #science
  142. libsais

    Bindings to the C library libsais for suffix array construction

    v0.2.0 #suffix-array #bioinformatics #bwt #saca #lcp
  143. noodles-tabix

    Tabix (TBI) format reader and writer

    v0.59.0 20K #bioinformatics #genomics #noodles #bio
  144. seq-hash

    A SIMD-accelerated library to compute hashes of DNA sequences

    v0.1.2 #bioinformatics #dna #bio
  145. slow5

    interacting with slow5

    v0.12.1 800 #genomics #record #serialization #zstd #zlib-ng #compression #bioinformatics #computational-biology
  146. fxread

    A barebones fastx reader for rust

    v0.2.14 1.5K #bioinformatics #dna #sequencing
  147. tsg

    Deep Learning Processing Library for Biological Data

    v0.1.9 750 #transcript-segment #biological-data #bioinformatics
  148. iitree-rs

    Implicit augmented interval tree (IAITree/cgranges) with memory-mapped disk storage

    v0.1.1 #genomics #interval-tree #bioinformatics #data-structures
  149. rvdna

    — AI-native genomic analysis and the .rvdna file format. Variant calling, protein prediction, and HNSW vector search in pure Rust with WASM support.

    v0.1.0 #genomics #bioinformatics #vector-search #dna
  150. deepbiop

    Deep Learning Processing Library for Biological Data

    v0.1.16 #deep-learning #bioinformatics #biological-data
  151. bamslice

    Extract byte ranges from BAM files and convert to interleaved FASTQ format for parallel processing

    v0.1.7 #bam #genomics #bioinformatics #fastq
  152. orphos-core

    Core library for Orphos, a tool for finding protein-coding genes in microbial genomes

    v0.1.0 #genomics #bioinformatics #prodigal
  153. thaf

    Extracts transcript sequences and gene maps from genome FASTA files using GFF3 annotations

    v0.0.5 #genome #bioinformatics #fasta #gff3 #transcriptome
  154. raxtax

    k-mer-based non-Bayesian Taxonomic Classifier

    v1.5.0 #classification #k-mer #taxonomic #database #query #bioinformatics
  155. noodles-cram

    CRAM format reader and writer

    v0.88.0 2.4K #bioinformatics #sam #bio
  156. rust-featurecounts

    A fast feature counting tool for prokaryotic RNA-seq analysis, compatible with featureCounts

    v0.1.0 #genomics #bioinformatics #rna-seq #prokaryote
  157. bgzip

    Rust implementation of bgzip

    v0.3.1 3.0K #gzip #compression #bioinformatics
  158. nucleaze

    Read filtering using k-mers

    v1.4.0 #k-mer #reference #fasta #filtering #matching #input-file #serialization #bioinformatics #brian #fastq
  159. lightmotif-io

    Parser implementations of several formats for the lightmotif crate

    v0.10.0 500 #bioinformatics #meme #motif #transfac #parser
  160. virust-splicing

    High-performance command-line program for analyzing HIV-1 splicing patterns from NGS data

    v0.2.1 #bioinformatics #splicing #ngs #hiv #virology
  161. sketchlib

    Genome and amino-acid sketching

    v0.2.4 550 #genomics #bioinformatics #genomics-sequencing #k-mer #sequencing #sketch
  162. minimap2-sys

    Bindings to libminimap2

    v0.1.30+minimap2.2.30 700 #bioinformatics #fasta #fastq
  163. infer_sex

    A high-performance, zero-dependency Rust library for inferring sex from variant data

    v0.1.2 2.2K #genomics #bioinformatics #sex #genetics
  164. fqtk

    A toolkit for working with FASTQ files

    v0.3.1 290 #genomics #bioinformatics
  165. tsg-cli

    analyze and manipulate transcript segment graph (TSG)

    v0.1.9 370 #biological-data #transcript-segment #bioinformatics
  166. nthash-rs

    Pure‑Rust port of ntHash

    v0.1.3 340 #bioinformatics #rolling-hash #k-mer #dna
  167. htsget-test

    Common test functions and utilities used by htsget-rs

    v0.7.1 1.2K #htsget #testing #htsget-rs #s3-storage #server #testing-http #mocking #aws-mocks #bioinformatics
  168. bamnado

    Tools and utilities for manipulation of BAM files for unusual use cases. e.g. single cell, MCC

    v0.4.5 #bam #bioinformatics #mcc #sc-atac-seq
  169. genepred

    port for the GenePred format in Rust

    v0.0.8 #bed #transcript #bioinformatics
  170. kira-protein-longevity-analysis

    CLI tool for physics-informed protein robustness and fragility analysis under environmental conditions (pH, oxidative stress, temperature)

    v0.1.0 #bioinformatics #proteomics #protein #bio
  171. htsget-lambda

    A cloud-based instance of htsget-rs using AWS Lambda, which serves data according to the htsget protocol

    v0.7.11 #aws-lambda #htsget #htsget-rs #cloud-based #instance #bioinformatics #amazon-s3
  172. simd-sketch

    A SIMD-accelerated library to compute a b-bit bottom-h sketch

    v0.3.0 170 #bioinformatics #min-hash #sketch #dna #bio
  173. crankshaft-monitor

    Crankshaft monitor server

    v0.2.0 #crankshaft #task-execution #cloud #headless #bioinformatics #hpc #bug-report
  174. genedex

    A small and fast FM-Index implementation

    v0.2.2 110 #bioinformatics #fm-index #fmindex #text-index #bio
  175. bsalign

    Rust bindings for the bsalign C library

    v0.7.0 950 #sequence-alignment #poa #bioinformatics #api-bindings
  176. parfait-gfa

    gfa v1/v2 parser and validator

    v0.1.3 #bioinformatics #gfa #parser
  177. noodles-util

    noodles support utilities

    v0.73.0 240 #bioinformatics #bam #noodles #bio
  178. bedrs

    Genomic interval library in rust

    v0.2.26 490 #range #bioinformatics #bed
  179. stats_on_gff3_ncbi

    Calculate statistics such as CDS GC3 ratio, intron GC ratio, flanking gene region GC ratio, first intron length, number of introns, CpG ratio, etc

    v0.1.52 1.8K #bioinformatics #gff3 #fasta
  180. coprosize

    coprolite research (paleontology and archaeology): estimate the producer's body mass based on coprolite diameter by the use of regression models

    v1.0.4 #bioinformatics #paleontology #biology #archaeology
  181. xpclrs

    A high-performance rust implementation of the XP-CLR method

    v1.0.0 #genome #bioinformatics #sequencing #selection
  182. pombase-gocam

    Parser for Gene Ontology Consortium GO-CAM JSON files

    v0.104.0 #bioinformatics #go-cam #gene-ontology
  183. fqgrep

    Search a pair of fastq files for reads that match a given ref or alt sequence

    v1.1.1 160 #genomics #bioinformatics #fastq
  184. noodles-gff

    Generic Feature Format (GFF) reader and writer

    v0.54.0 1.9K #bioinformatics #genomics #noodles
  185. kira-mitoqc

    Deterministic mitochondrial QC scoring for single-cell expression matrices

    v0.1.2 #bioinformatics #cell #bio #quality-control #mitochondria #oncology
  186. noodles-bed

    BED (Browser Extensible Data) reader and writer

    v0.30.0 3.4K #bioinformatics #noodles #bed #bio
  187. kira-proteoqc

    Deterministic, explainable proteostasis QC for single-cell expression data

    v0.1.0 #bioinformatics #cell #proteomics #oncology #sc-rna
  188. crankshaft-config

    Configuration facilities for Crankshaft

    v0.6.0 100 #crankshaft #config #task-execution #bioinformatics #headless #hpc #facilities
  189. bgzf

    working with explicitly BGZF compressed data

    v0.3.0 #genomics #bioinformatics #compression
  190. bsxplorer2

    A high-performance library for bisulfite sequencing data analysis and DNA methylation research

    v0.2.3 700 #genomics #bioinformatics #methylation #bisulfite #epigenetics
  191. rnapkin

    CLI utility for drawing RNA secondary structure

    v0.3.9 160 #secondary-structure #draw #rna #bioinformatics #fold
  192. phenopackets

    Rust bindings for Phenopacket Schema

    v0.2.2-post2 150 #ontology #phenotype #bioinformatics
  193. deepbiop-cli

    CLI tool for Processing Biological Data

    v0.1.16 #bioinformatics #deep-learning #biological-data
  194. prseq

    Rust tools (with Python bindings) for sequence analysis

    v0.0.33 #bioinformatics #sequence-analysis #fastq #fasta
  195. fastqgen

    Random paired fastq file generation. Nothing more, nothing less.

    v0.1.4 #bioinformatics #fastq #generate
  196. bam-builder

    easily building BAMs for testing

    v1.1.0 #bam #bioinformatics #htslib
  197. pluma-plugin-trait

    A trait defining the plugin interface for PluMA (Plugin-based Microbiome Analysis)

    v1.1.0 #traits #microbiome #bioinformatics #pluma #api-bindings
  198. stats_on_gff3

    Calculate statistics such as CDS GC3 ratio, intron GC ratio, flanking gene region GC ratio, first intron length, number of introns, CpG ratio, etc. Examples: stats_on_gff3 Homo_sapiens…

    v0.1.26 320 #bioinformatics #gff3 #fasta
  199. consalifold

    Consensus Secondary Structure Predictor Engaging Structural Alignment-based Error Correction

    v0.1.17 #secondary-structure #rna #bioinformatics
  200. noodles-gtf

    Gene Transfer Format (GTF) reader and writer

    v0.49.0 350 #bioinformatics #noodles #bio
  201. seqtable

    High-performance parallel FASTA/FASTQ sequence counter

    v0.1.1 #bioinformatics #fasta-sequence #fastq #fasta #parquet #bioinformatics-sequence
  202. kira_cdh_compat_fastq_reader

    Streaming FASTQ reader compatible with CD-HIT input handling (plain and .gz), safe idiomatic Rust API; sync and async

    v0.1.3 #fastq #bioinformatics #gzip #streaming #cd-hit
  203. kira_cdh_compat_clstr

    CD-HIT-compatible .clstr writer/reader and a semantic diff CLI

    v0.1.2 #semantic-diff #cluster #cd-hit #bioinformatics
  204. gia

    set theoretic operations of genomic intervals

    v0.2.23 650 #genomics #interval-set #bioinformatics #bed #set-interval
  205. genome-graph

    Representation of genome graphs

    v11.0.0 #genome #bioinformatics #graph-representation
  206. pa-types

    Core library types for global pairwise alignment

    v1.2.0 140 #pairwise-alignment #bioinformatics #library
  207. fastq_rs

    Multi purpose fastq toolkit

    v0.0.2 #bioinformatics #toolkit
  208. give_a_sheet

    Toolkit for generating input samplesheets for a variety of nf-core pipelines

    v0.1.6 190 #bioinformatics #reproducibility #data
  209. scidataflow

    A command-line tool to manage scientific research project data

    v0.8.11 290 #bioinformatics #data-science #reproducibility
  210. markov_genome

    Learn the properties of a FASTA sequence database and simulate sequences in a Markov process

    v0.1.0 #genomics #bioinformatics #bio #markov #binaries
  211. onecode

    Rust bindings for ONEcode - a data representation format for genomic data

    v0.1.0 #genomics #bioinformatics #api-bindings
  212. crankshaft-engine

    The core engine that comprises Crankshaft

    v0.9.0 170 #task-execution #execution-engine #crankshaft #cloud #headless #bioinformatics #hpc #comprises
  213. strobemers

    A toolkit for generating strobemers

    v0.1.1 #bioinformatics #biology
  214. rna-ss-params

    RNA secondary structure parameters

    v0.1.24 250 #secondary-structure #rna #bioinformatics
  215. skope

    Accelerated genome containment and abundance estimation

    v0.1.0 #genomics #bioinformatics
  216. phylotree

    deal with phylogenetic trees and distance matrices

    v0.1.3 #newick #bioinformatics #phylip #cli
  217. biosynth

    CLI: synthetic data generator for BioVault

    v0.1.19 #synthetic-data #bioinformatics #cli #data #synthetic
  218. disambiseq

    Create unambiguous one-off mismatch libraries for DNA sequences

    v0.2.2 150 #bioinformatics #mismatch #sequencing #dna
  219. flash-cli

    Command-line interface for the FLASH read merging algorithm

    v0.1.0 #bioinformatics #fastq #cli
  220. lightmotif-py

    PyO3 bindings and Python interface to the lightmotif crate

    v0.10.0 850 #bioinformatics #python-bindings #pssm
  221. genomicsqlite

    Genomics Extension for SQLite

    v0.9.4 #genomics #bioinformatics #sqlite-extension #database-extension #genomic-data
  222. kamino-cli

    Build phylogenomic datasets in seconds

    v0.7.0 #bioinformatics #k-mer #phylogenomic
  223. Try searching with DuckDuckGo.

  224. nucleob

    bioinformatics: nucleobases and amino acids statistics

    v1.1.1 #bioinformatics #dna-rna #genetics #dna #rna #protein
  225. hts-sys

    HTSlib bindings

    v2.2.0 6.3K #bam #bioinformatics #htslib #sequencing #pileup
  226. diced

    reimplementation of the MinCED algorithm for identifying CRISPRs in full or assembled genomes

    v0.1.2 150 #genomics #crispr #bioinformatics
  227. crankshaft-docker

    Docker facilities for Crankshaft

    v0.4.0 140 #task-execution #crankshaft #docker-client #headless #bioinformatics #hpc #facilities
  228. placecare

    A toolkit to quickly search for cis-acting regulatory elements using the PLACE database

    v1.1.0 390 #bioinformatics #sequence-search
  229. dabuild

    Access genome build metadata

    v0.3.1 #build #bioinformatics #reference #genome
  230. microbiorust-py

    Python bindings for microBioRust Microbiology friendly bioinformatics Rust functions

    v0.1.5 #genomics #bioinformatics #sequence-analysis #micro #bio
  231. ragc-core

    Core compression and decompression algorithms for the AGC genome compression format

    v0.1.1 #genomics #bioinformatics #compression #agc
  232. minimizer-iter

    Iterate over minimizers of a DNA sequence

    v1.2.1 #bioinformatics #iterator #minimizer #dna
  233. kira_cdh_compat_cluster

    Greedy clustering engine compatible with CD-HIT-like pipelines

    v0.1.0 #min-hash #bioinformatics #lsh #cd-hit
  234. wdl-cli

    Facilities for building command-line tools using the wdl crates

    v0.7.0 #sprocket #workflow-description-language #document #lint #rules #bioinformatics #input-file #language-server #facilities #command-line-task
  235. ebiotic

    interacting with common bioinformatics web services

    v0.0.26 750 #genomics #bioinformatics #biology #ncbi #ebi
  236. deepbiop-utils

    Deep Learning Preprocessing Library for Biological Data

    v0.1.16 #deep-learning #bioinformatics #biological-data
  237. flow-tru-ols

    TRU-OLS (Truncated ReUnmixing OLS) algorithm for flow cytometry unmixing

    v0.1.0 #flow-cytometry #bioinformatics #unmixing #cytometry #flow
  238. yacrd

    Using all-against-all read mapping, yacrd performs: computation of pile-up coverage for each read and detection of chimeras

    v0.6.2 #long-read #bioinformatics #chimera #scrubbing
  239. msafara

    View, edit, and explore multiple sequence alignments in your terminal

    v0.3.0 #bioinformatics #phylogenetic-tree #msa #bio #terminal
  240. pluma

    Plugin Interface for Rust - provides FFI bindings and utilities for writing PluMA plugins in Rust

    v1.0.0 #bioinformatics #microbiome #plugin #api-bindings
  241. kira_cdh_compat_kmer_indexer

    CD-HIT-compatible k-mer indexing (CD-HIT-NG) in Rust: fast, memory-efficient, mmap-ready

    v0.2.0 #bioinformatics #k-mer #fastq #cd-hit #index
  242. sais_drum

    SAIS algorithm for suffix array construction

    v0.1.1 #suffix-array #bioinformatics #saca #suffixarray #bio
  243. blutils-proc-adapter

    easier to run and analyze Blast results

    v8.3.1 1.8K #bioinformatics #blast #ncbi #similarity-analysis #dna
  244. vcf

    VCF Parser

    v0.6.1 #bioinformatics #parser