8 releases
| 0.0.8 | Jan 20, 2026 |
|---|---|
| 0.0.7 | Jan 20, 2026 |
| 0.0.6 | Dec 30, 2025 |
| 0.0.1 | Nov 13, 2025 |
#228 in Biology
47 downloads per month
Used in gxf2bed
190KB
3.5K
SLoC
genepred
a port for the GenePred format in Rust
docs .
usage .
features .
examples
Overview
This library provides a port to read genomic interval data in BED, GTF, and GFF (+ gz/zst/bz2) formats, representing them all as GenePred records.
Quick Start
Add this to your Cargo.toml:
[dependencies]
genepred = "0.0.8"
# Optional features
genepred = { version = "0.0.8", features = ["gzip", "zstd", "bz2", "mmap", "rayon"] }
Usage
// Enable both "rayon" and "mmap" features in Cargo.toml
use genepred::{Reader, Bed12, Gtf};
use rayon::prelude::*;
fn main() -> Result<(), Box<dyn std::error::Error>> {
// Parallel processing of BED files, includes .gz/.zst/.bz2 files
let bed_reader = Reader::<Bed12>::from_mmap("data/large.bed")?;
if let Ok(records) = bed_reader.par_records() {
let count = records
.filter_map(Result::ok)
.filter(|r| r.strand.map(|s| s.is_plus()).unwrap_or(false))
.count();
println!("Found {} records on plus strand", count);
}
// Parallel processing of GTF files, includes .gz/.zst/.bz2 files
let gtf_reader = Reader::<Gtf>::from_mmap("data/annotations.gtf")?;
if let Ok(records) = gtf_reader.par_records() {
let total_exons: usize = records
.filter_map(Result::ok)
.map(|r| r.exon_count())
.sum();
println!("Total exons: {}", total_exons);
}
Ok(())
}
Features
mmap: Enable memory-mapped file support (addsmemmap2dependency)rayon: Enable parallel processing (addsrayondependency)gzip: Enable gzip support (addsflate2dependency)zstd: Enable zstd support (addszstddependency)bz2: Enable bzip2 support (addsbzip2dependency)
Dependencies
~0.1–1.1MB
~20K SLoC