Showing 44 open source projects for "python data analysis"

View related business solutions
  • Powerful App Monitoring Without Surprise Bills Icon
    Powerful App Monitoring Without Surprise Bills

    AppSignal starts at $23/month with all features included. No overages, no hidden fees. 30-day free trial.

    Tired of monitoring tools that punish you for scaling? AppSignal offers transparent, predictable pricing with every feature unlocked on every plan. Track errors, monitor performance, detect anomalies, and manage logs across Ruby, Python, Node.js, and more. Trusted by developers since 2012 with free dev-to-dev support. No credit card required to start your 30-day trial.
    Try AppSignal Free
  • Push Code. Get a Production URL. Done. Icon
    Push Code. Get a Production URL. Done.

    Cloud Run deploys any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try Cloud Run Free
  • 1
    Awesome Network Analysis

    Awesome Network Analysis

    A curated list of awesome network analysis resources

    awesome-network-analysis is a curated list of resources focused on network and graph analysis, including libraries, frameworks, visualization tools, datasets, and academic papers. It covers multiple programming languages and domains like sociology, biology, and computer science. This repository serves as a central reference for researchers, analysts, and developers working with network data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    reticulate

    reticulate

    R Interface to Python

    reticulate is an R package from Posit that creates seamless interoperability between R and Python. It lets you call Python modules, classes, and functions from within R, automatically translating between R and Python data structures. Useful for combining Python tooling with R projects, data analysis, and RMarkdown reports.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    NYC Taxi Data

    NYC Taxi Data

    Import public NYC taxi and for-hire vehicle (Uber, Lyft)

    The nyc-taxi-data repository is a rich dataset and exploratory project around New York City taxi trip records. It collects and preprocesses large-scale trip datasets (fares, pickup/dropoff, timestamps, locations, passenger counts) to enable data analysis, modeling, and visualization efforts. The project includes scripts and notebooks for cleaning and filtering the raw data, memory-efficient processing for large CSV/Parquet files, and aggregation workflows (e.g. trips per hour, heatmaps of pickups/dropoffs). ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    ShinyItemAnalysis

    ShinyItemAnalysis

    Test and Item Analysis via Shiny

    ...Item analysis with IRT models. Detection of differential item functioning. Number of toy datasets is available, the interactive application also allows the users to upload and analyze their own data and to automatically generate PDF or HTML reports. All methods include sample R code which is ready to copy and paste into R and run locally. Several toy data sets are ready to use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end. Migrate from on-prem or other clouds with free migration tools.
    Try Free
  • 5
    clusterProfiler

    clusterProfiler

    A universal enrichment tool for interpreting omics data

    clusterProfiler is an R/Bioconductor package that provides a unified workflow for functional enrichment analysis to interpret high-throughput omics results. It supports both over-representation analysis and gene set enrichment analysis, letting you work with unranked gene lists or ranked statistics from differential pipelines. The package connects to multiple knowledge bases—such as Gene Ontology, KEGG, Reactome, Disease Ontology, MeSH and others—through a consistent interface so you can...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    ggstatsplot

    ggstatsplot

    Enhancing {ggplot2} plots with statistical analysis

    {ggstatsplot} is an extension of {ggplot2} package for creating graphics with details from statistical tests included in the information-rich plots themselves. In a typical exploratory data analysis workflow, data visualization and statistical modeling are two different phases: visualization informs modeling, and modeling in its turn can suggest a different visualization method, and so on and so forth. Bayesian hypothesis-testing. The central idea of {ggstatsplot} is simple: combine these two phases into one in the form of graphics with statistical details, which makes data exploration simpler and faster. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    tidytext

    tidytext

    Text mining using tidy tools

    tidytext brings tidy data principles to text mining by converting text into a tidy data frame format. It provides tools for tokenization, sentiment analysis, n‑gram creation, and term‑document matrices, enabling interoperability with dplyr, ggplot2, and other tidyverse workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    dplyr

    dplyr

    dplyr: A grammar of data manipulation

    dplyr is an R package that provides a consistent and intuitive grammar for data manipulation, enabling users to filter, arrange, summarize, and transform data efficiently. Part of the tidyverse ecosystem, dplyr simplifies complex data operations through a clear and readable syntax, whether working with data frames, tibbles, or databases. It is widely used in data science and statistical analysis workflows.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Seurat

    Seurat

    R toolkit for single cell genomics

    Seurat is a comprehensive R toolkit for single-cell genomics analysis, introduced by the Satija Lab at NYGC. It supports quality control, normalization, clustering, integration of multimodal data (e.g., scRNA‑seq, spatial, CITE‑seq), and visualization. Seurat v5 introduces scalable workflows and spatial transcriptomics support, commonly used in academic and industry research for single-cell studies.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Cut Cloud Costs with Google Compute Engine Icon
    Cut Cloud Costs with Google Compute Engine

    Save up to 91% with Spot VMs and get automatic sustained-use discounts. One free VM per month, plus $300 in credits.

    Save on compute costs with Compute Engine. Reduce your batch jobs and workload bill 60-91% with Spot VMs. Compute Engine's committed use offers customers up to 70% savings through sustained use discounts. Plus, you get one free e2-micro VM monthly and $300 credit to start.
    Try Compute Engine
  • 10
    forecast

    forecast

    Forecasting Functions for Time Series and Linear Models

    The forecast package is a comprehensive R package for time series analysis and forecasting. It provides functions for building, assessing, and using univariate forecasting models (e.g. ARIMA, exponential smoothing, etc.), tools for automatic model selection, diagnostics, plotting, forecasting future values, etc. It's widely used in statistics, economics, business forecasting, environmental science, etc. Exponential smoothing state space models (ETS) including seasonal components. Residual...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    workflowr

    workflowr

    Organize your project into a research website

    workflowr is an R package that helps researchers organize, version, and share their data science projects in a reproducible and transparent manner. It combines R Markdown, Git, and a structured file system to create a research website that tracks analysis, results, and code changes over time. It’s ideal for academic and collaborative research workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    LabPlot

    LabPlot

    Data Visualization and Analysis

    LabPlot is a FREE, open source and cross-platform Data Visualization and Analysis software accessible to everyone.
    Downloads: 41 This Week
    Last Update:
    See Project
  • 13
    see

    see

    Visualisation toolbox for beautiful and publication-ready figures

    see is an R package that serves as the visualization component of the easystats ecosystem, providing plotting utilities to produce publication-ready visualizations of statistical model parameters, diagnostics, predictions, and performance metrics. It works in conjunction with other easystats packages (such as parameters, performance, modelbased, bayestestR, etc.) to convert model outputs or summary objects into visual forms (dot-and-whisker plots, diagnostic plots, residual plots, etc.). It...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    broom

    broom

    Convert statistical analysis objects from R into tidy format

    broom is part of the tidymodels ecosystem that converts statistical model outputs (e.g. from lm, glm, t.test, lme4, etc.) into tidy tibbles — standardized data frames — using functions tidy(), glance(), and augment(). These are easier to manipulate, visualize, and report programmatically.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    psychmeta

    psychmeta

    Psychometric meta-analysis toolkit

    The psychmeta package provides tools for computing bare-bones and psychometric meta-analyses and for generating psychometric data for use in meta-analysis simulations. Currently, the package supports bare-bones, individual-correction, and artifact-distribution methods for meta-analyzing correlations and d values. Please refer to the overview tutorial vignette for an introduction to psychmeta’s functions and workflows. psychmeta is hosted on both CRAN and GitHub.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    rmarkdown

    rmarkdown

    Dynamic Documents for R

    R Markdown is an R package for creating dynamic, reproducible documents that combine code (R, Python, SQL, etc.), results (figures, tables), and narrative text. Built on Knitr and Pandoc, it supports generating HTML, PDF, Word, slideshows, dashboards, and more. It’s widely used in data science and reproducible reporting workflows.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 17
    mlr3

    mlr3

    mlr3: Machine Learning in R - next generation

    mlr3 is a modern, object-oriented R framework for machine learning. It provides core abstractions (tasks, learners, resamplings, measures, pipelines) implemented using R6 classes, enabling extensible, composable machine learning workflows. It focuses on clean design, scalability (large datasets), and integration into the wider R ecosystem via extension packages. Users can do classification, regression, survival analysis, clustering, hyperparameter tuning, benchmarking etc., often via...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    Statistical Rethinking 2024

    Statistical Rethinking 2024

    This course teaches data analysis

    The 2024 repository is the most recent version of the course, reflecting ongoing refinements in pedagogy, statistical modeling techniques, and coding practices. It provides updated notebooks, R scripts, and model examples, some streamlined and restructured compared to previous years. The 2024 repo also highlights the transition toward more robust Stan models and integration with newer Bayesian workflow practices, continuing to emphasize accessibility for learners while modernizing the tools....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    MHNs Data Science Examples

    MHNs Data Science Examples

    Collection of data science examples.

    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    DiagrammeR

    DiagrammeR

    Graph and network visualization using tabular data in R

    DiagrammeR is an R package to create, manipulate, and visualize network graphs, flowcharts, diagrams, and more using Graphviz and Mermaid syntax. Integrates with RMarkdown and Shiny apps, supports node/edge traversal, and graph analysis algorithms, making it ideal for documenting processes, causal relationships, or data pipelines.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Statistical Rethinking 2023

    Statistical Rethinking 2023

    Statistical Rethinking Course for Jan-Mar 2023

    ...It continues to provide scripts for lectures and tutorials, while integrating refinements to examples, notation, and computational workflows introduced that year. Compared with 2022, some models are rewritten for clarity, and teaching materials reflect refinements in McElreath’s evolving presentation of Bayesian data analysis. Students following the 2023 lecture videos use this repository as their coding reference. There are 10 weeks of instruction. Links to lecture recordings will appear in this table. Weekly problem sets are assigned on Fridays and due the next Friday, when we discuss the solutions in the weekly online meeting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    AI-Agent-Host

    AI-Agent-Host

    The AI Agent Host is a module-based development environment.

    ...Being data-aware involves connecting a language model to other sources of data, enabling a comprehensive understanding and analysis of information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    methylr

    methylr

    a single shiny solution from sequencer data to pathway analysis

    Here we introduce methylR, a complete pipeline for the analysis of both 450K and EPIC Illumina arrays which not only offers data visualization and normalization but also provide additional features such as the annotation of the genomic features resulting from the analysis, pairwise comparisons of DMCs with different graphical representation plus functional and pathway enrichment as downstream analysis, all packed in a minimal, elegant and intuitive graphical user interface which brings the analysis of array DNA methylation data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    hui

    hui

    hewies user interface - 3D scientific visualisation tool

    Python project with goal to provide FOSS library to extract, analyse and visualise data in a 3D fashion. The instance will connect to a data source, ods sheet, csv, sql DB, pyodbc the instance will analyse and/or transform the data to be presented to the visualisation functionality the instance will visualise the data in a 3D fashion, likely using third party FOSS
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    activAnalyzer
    activAnalyzer is a Shiny app that has been developed to analyze daily physical behavior data recorded at the hip in adults using an ActiGraph accelerometer (.agd file from a GT3X, GT3X+, wGT3X+ or wGT3X-BT device). Once analysis is completed, the app allows exporting results to .csv files and generating a report of the measurement (in either an .html format or a .pdf format). All the configured inputs relevant for interpreting the results are recorded in the report.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB