Browse free open source Data Warehousing software and projects below. Use the toggles on the left to filter open source Data Warehousing software by OS, license, language, programming language, and project status.

  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • 1
    DataCleaner

    DataCleaner

    Data quality analysis, profiling, cleansing, duplicate detection +more

    DataCleaner is a data quality analysis application and a solution platform for DQ solutions. It's core is a strong data profiling engine, which is extensible and thereby adds data cleansing, transformations, enrichment, deduplication, matching and merging. Website: http://datacleaner.github.io
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data Creation, Meta Data Discovery, Anomaly Discovery, Data Cleansing, Reporting and Analytic. It also had Hadoop ( Big data ) support to move files to/from Hadoop Grid, Create, Load and Profile Hive Tables. This project is also known as "Aggregate Profiler" Resful API for this project is getting built as (Beta Version) https://sourceforge.net/projects/restful-api-for-osdq/ apache spark based data quality is getting built at https://sourceforge.net/projects/apache-spark-osdq/
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    BEE
    The BEE Project is a suite of tools supporting Business Intelligence project implementation including ETL tool and OLAP server and a thin client. The ROLAP server ensures multipass SQL generation and powerful cache management (utilizes MySQL RDBMS).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    An Expert Search System for Enterprise Search based on the Information Retrieval Vector Space Model. Our model builds a weighted profile for each candidate and keeps all the documents, thus allowing to retrieve both people and documents together.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Free and Open Source HR Software Icon
    Free and Open Source HR Software

    OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.

    Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.
    Learn More
  • 5
    FormatCheck screens flat files looking for violations in the format of the data. It uses a set of XML files that define the rules for each file format. The Swing front-end allows the user to run the verification, view and print the errors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Interlogy - next generation knowledge storage and prezentation system. Gives wide functionality for knwoledge evaluation. Interlogy is the same as forum, wiki, chat, socialnetwork, but all together.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Monitors webpages for changes and emails output with differences to subscribers. Permits user accounts and registration. PHP/MYSQL.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Metis for Oracle
    A tool to support data-analysis on a Oracle-DB. In Greek mythology, Metis (Μῆτις) was of the Titan generation ... The Stoic commentators allegorized Metis as the embodiment of "wisdom" or "wise counsel" ... (from Wikipadia)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Comprehensive & high performance data distribution &transformation system. Features a simple, user-friendly event driven scripting interface transparently generates & execs highly efficient Perl/C code. Uses:ETL,datawarehousing,statistics,data-cleansing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Desktop and Mobile Device Management Software Icon
    Desktop and Mobile Device Management Software

    It's a modern take on desktop management that can be scaled as per organizational needs.

    Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.
    Learn More
  • 10
    SPASE Model is a collection of tools for working with the structured data model information. Tools can convert the relational version of the data model into various expressions, including XSD, XMI and PDF documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    SYRAH si propone di far emergere e rappresentare i concetti espressi per mezzo di un linguaggio naturale. SYRAH aims to discover and represent concepts expressed in natural languages. NLP, lemma, lemmario, italiano, rete, semantica, clustering, semantic
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    TopicMapTools allows the creation, manipulation and visualization of XML Topic Maps. It is based on the TM4J framework. The editor can be used in genealogy for the visualisation of e.g. family trees.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    openMEDIS

    open source medical device information system

    OpenMEDIS is a simple, flexible, and standardized tool to collect (inventory) and process (manage) information on health technology infrastructure, e.g. medical devices in low- and middle income countries. The latest source code is available in the folder Code/trunk. There are also sub-projects of openMEDIS that contain the source-code of installations in each country.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Trauma registry suite; Data collection application and server scripts to build trauma data warehouse and perform web-based analysis reporting. Cross-platform compatible for Windows, Apple, Unix, or Linux.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    This tool acts as a preprocessor and transforms data from a database into arff format for weka data mining. Vertical to horizontal transformation for association analysis. The tool can use databases for which an OleDB adapter exists. Vb.Net for Win32
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next