Compare the Top Data Discovery Software for Linux as of January 2026

What is Data Discovery Software for Linux?

Data discovery software is a type of software tool that allows users to quickly identify patterns, trends, and relationships in large datasets. It utilizes tools such as natural language processing and machine learning to quickly analyze data and uncover insights. Data discovery software can be used in a variety of areas such as healthcare, business intelligence, fraud detection, risk management, and more. Its purpose is to give its users quick access to the most relevant data so they can make informed decisions. Compare and read user reviews of the best Data Discovery software for Linux currently available using the table below. This list is updated regularly.

  • 1
    Semarchy xDM
    Use Semarchy unified data platform to experience xDM. Discover, govern, enrich, enlighten and manage data. You can easily transform data into insights with xDM and rapidly deliver data-rich applications with automated master data management. Its business-centric interfaces provide for rapid creation and adoption of data-rich applications, while automation rapidly generates applications to your specific requirements. Use the agile platform to quickly expand or evolve data applications.
    Leader badge
    View Software
    Visit Website
  • 2
    Omniscope Evo
    Visokio builds Omniscope Evo, complete and extensible BI software for data processing, analytics and reporting. A smart experience on any device. Start from any data in any shape, load, edit, blend, transform while visually exploring it, extract insights through ML algorithms, automate your data workflows, and publish interactive reports and dashboards to share your findings. Omniscope is not only an all-in-one BI tool with a responsive UX on all modern devices, but also a powerful and extensible platform: you can augment data workflows with Python / R scripts and enhance reports with any JS visualisation. Whether you’re a data manager, scientist or analyst, Omniscope is your complete solution: from data, through analytics to visualisation.
    Starting Price: $59/month/user
  • 3
    OvalEdge

    OvalEdge

    OvalEdge

    OvalEdge is a cost-effective data catalog designed for end-to-end data governance, privacy compliance, and fast, trustworthy analytics. OvalEdge crawls your organizations’ databases, BI platforms, ETL tools, and data lakes to create an easy-to-access, smart inventory of your data assets. Using OvalEdge, analysts can discover data and deliver powerful insights quickly. OvalEdge’s comprehensive functionality enables users to establish and improve data access, data literacy, and data quality.
    Starting Price: $1,300/month
  • 4
    DashboardFox
    Dashboards, codeless reporting, interactive data visualizations, data level security, mobile access, scheduled reports, embedding, sharing via link, and more. DashboardFox is a dashboard and data visualization solution designed for business users with a no-subscription pricing model. Pay once and you own the software for life. DashboardFox is self-hosted, install on your own server, behind your firewall. Looking for Cloud BI? We offer managed hosting services, but you still retain ownership of your DashboardFox licenses and data. DashboardFox allows your users to drill-down and interact with live data visualizations via dashboards and reports. Business users can create new visualization in a codeless report builder without needing a technical pedigree. An alternative to Tableau, Sisense, Looker, Domo, Qlik, Crystal Reports, and others.
    Starting Price: $495 one-time payment
  • 5
    IRI DarkShield

    IRI DarkShield

    IRI, The CoSort Company

    IRI DarkShield is a powerful data masking tool that can (simultaneously) find and anonymize Personally Identifiable Information (PII) "hidden" in semi-structured and unstructured files and database columns / collections. DarkShield jobs are configured, logged, and run from IRI Workbench or a restful RPC (web services) API to encrypt, redact, blur, etc., the PII it finds in: * NoSQL & RDBs * PDFs * Parquet * JSON, XML & CSV * Excel & Word * BMP, DICOM, GIF, JPG & TIFF DarkShield is one of 3 data masking products in the IRI Data Protector Suite, and comes with IRI Voracity data management platform subscriptions. DarkShield bridges the gap between structured and unstructured data masking, allowing users to secure data in a consistent manner across disparate silos and formats by using the same masking functions as FieldShield and CellShield EE. DarkShield also handles data in RDBs and flat-files, too, but there are more capabilities that FieldShield offers for those sources.
    Starting Price: $5000
  • 6
    Data Rover

    Data Rover

    Data Rover

    Data Rover is an Advanced User Data and Security Management for any Data-Driven Organisation. A single solution for Infrastructure and Security managers that allows data users to explore, manage, process, and protect their data effectively and efficiently, by simultaneously addressing the two primary needs related to the use of data: Cyber Security and Data Management. Data Rover plays a key role in business asset protection and corporate data management policy definition. Data Analytics Check for security flaws and eliminate issues. Simplify the management of permissions. File Auditor It gives you the proof that something was done. Right or Wrong it's not important - JUST the FACTS. Dark Data Makes work faster and safer by optimising the storage resources usage and reducing costs. Involve the users in data management so they can contribute in keeping the storage systems clean and efficient. Advanced Data Exchange Share business data in/out of the company SAFELY.
  • 7
    Knovos Discovery
    A Holistic eDiscovery Solution Knovos Discovery is eDiscovery made easy. Knovos’ end-to-end solution covers the entire litigation life cycle — from processing, early case assessment, review to production. Our powerful built-in analytics engine enables deep, multidimensional analysis for a 360-degree view of your data.
  • 8
    MinerEye DataTracker
    MinerEye’s DataTracker enables organizations to overcome the information governance and protection challenge. It automatically scans, indexes, analyzes, virtually labels and categorizes every piece of unstructured and dark data contained in the organization’s data repositories. With proprietary Interpretive AI™, machine learning, and computer vision, the solution locates relevant files out of the billions that are stored, accurately evaluates them, qualifies them by significance and purpose, and automatically sends alerts with next best action recommendations in cases of conflicts, duplications, or potential violations. This way, data protection is profoundly enhanced while risk and operational costs are reduced.
    Starting Price: $2000/1TB/month
  • 9
    Open Content Platform

    Open Content Platform

    Open Content Platform

    Open Content Platform (OCP) is an open-source IT discovery and integration platform. OCP handles discovery, integration, and modeling for solutions in the datacenter. The core framework is micro-service, enabling process-level horizontal scalability. It’s fully extensible, designed for developers, and leverages components in the open-source space, to greatly reduce product complexity. The dynamic discovery is fundamentally different from other ITSM vendors, and the data-driven modeling keeps Applications and Services up-to-date for downstream consumption (e.g. Service-aware monitoring).
    Starting Price: $0
  • 10
    Enterprise Recon

    Enterprise Recon

    Ground Labs

    With Enterprise Recon by Ground Labs, organizations can find and remediate sensitive information across the broadest range of structured and unstructured data, whether it’s stored on your servers, on your employees’ devices, or in the cloud. Enterprise Recon enables organizations worldwide to seamlessly discover all data and comply with 50+ country regulations inc GDPR, PCI DSS, CCPA, HIPAA, Australian Privacy and other data security standards that require the ability to locate and secure PII data as well as information on gender, ethnicity and health… or even non-PII financial data. Enterprise Recon is powered by GLASS™, Ground Labs' proprietary technology that enables the quickest and most accurate data discovery across the broadest set of platforms available. Enterprise Recon natively supports sensitive data discovery on Windows, macOS, Linux, FreeBSD, Solaris, HP-UX and IBM AIX using agent and agentless options. Additional remote options also enable almost any network data stored.
  • 11
    Enov8

    Enov8

    Enov8

    End-to-end “Business Intelligence” for your IT organization. Promoting transparency, control, and productivity across environments, release and data. Promote scaled agility across your IT fabric. A complete environment and release picture supporting collaboration across teams and providing the insight that organizations require today to drive competitive innovation. Improve visibility of your complex IT fabric allowing better collaboration and decision making. Manage complex computer systems & the end-to-end IT fabric through a centralized portal. Measure test environment usage to reduce IT spend and increase project productivity. Eliminate chaotic and non-repeatable operations by establishing control via centralized runbooks and using automation on regular & time consuming tasks. Manage change and contention effectively whilst providing real time health status and powerful analytics to determine business impact.
    Starting Price: $8 per month
  • 12
    Protegrity

    Protegrity

    Protegrity

    Our platform allows businesses to use data—including its application in advanced analytics, machine learning, and AI—to do great things without worrying about putting customers, employees, or intellectual property at risk. The Protegrity Data Protection Platform doesn't just secure data—it simultaneously classifies and discovers data while protecting it. You can't protect what you don't know you have. Our platform first classifies data, allowing users to categorize the type of data that can mostly be in the public domain. With those classifications established, the platform then leverages machine learning algorithms to discover that type of data. Classification and discovery finds the data that needs to be protected. Whether encrypting, tokenizing, or applying privacy methods, the platform secures the data behind the many operational systems that drive the day-to-day functions of business, as well as the analytical systems behind decision-making.
  • 13
    Cyberquery

    Cyberquery

    Cyberscience Corporation

    Cyberscience is an international software organization which offers a Business Intelligence software suite named Cyberquery. Cyberquery is offered in both SaaS and traditional licensing models. Some of Cyberquery’s most valued features include intuitive UI, analytics with drills, data visualization, dashboards, XLS integration and automated content distribution. Unlike most vendors in the BI space, Cyberscience differentiates itself by offering live phone support in addition to email, with a support team averaging 15 years industry experience. The Cyberscience support team provides same day responses to issues, and they score very highly on customer satisfaction surveys.
  • 14
    BigID

    BigID

    BigID

    BigID is data visibility and control for all types of data, everywhere. Reimagine data management for privacy, security, and governance across your entire data landscape. With BigID, you can automatically discover and manage personal and sensitive data – and take action for privacy, protection, and perspective. BigID uses advanced machine learning and data intelligence to help enterprises better manage and protect their customer & sensitive data, meet data privacy and protection regulations, and leverage unmatched coverage for all data across all data stores. 2
  • 15
    Fasoo Data Radar
    Fasoo Data Radar (FDR) is a data discovery and classification solution that helps organizations locate, analyze, and manage sensitive unstructured data across on-premise servers, cloud storage, and endpoints. It scans and classifies files based on keywords, regex patterns, file formats, and other predefined policies, ensuring organizations maintain control over critical information. With real-time monitoring and centralized policy enforcement, FDR enhances data security by identifying risks, preventing unauthorized access, and assisting with compliance requirements such as GDPR, HIPAA, and CCPA. Its integration with enterprise security frameworks allows organizations to apply consistent data protection policies while improving operation workflows. By automating data classification and governance, FDR increases efficiency and enhances data visibility for security and compliance management.
  • 16
    IRI Voracity

    IRI Voracity

    IRI, The CoSort Company

    Voracity is the only high-performance, all-in-one data management platform accelerating AND consolidating the key activities of data discovery, integration, migration, governance, and analytics. Voracity helps you control your data in every stage of the lifecycle, and extract maximum value from it. Only in Voracity can you: 1) CLASSIFY, profile and diagram enterprise data sources 2) Speed or LEAVE legacy sort and ETL tools 3) MIGRATE data to modernize and WRANGLE data to analyze 4) FIND PII everywhere and consistently MASK it for referential integrity 5) Score re-ID risk and ANONYMIZE quasi-identifiers 6) Create and manage DB subsets or intelligently synthesize TEST data 7) Package, protect and provision BIG data 8) Validate, scrub, enrich and unify data to improve its QUALITY 9) Manage metadata and MASTER data. Use Voracity to comply with data privacy laws, de-muck and govern the data lake, improve the reliability of your analytics, and create safe, smart test data
  • 17
    SalesIntel

    SalesIntel

    SalesIntel

    Get the Best B2B Data Available with SalesIntel. SalesIntel is the top provider of accurate and affordable sales and marketing contact data with highest number of direct dials and human-verified contacts in the industry. SalesIntel is your resource for contact and business intelligence. Our combination of automation and researchers allows us to reach 95% data accuracy for all our published contact data while continuing to scale up our number of contacts. SalesIntel’s new method is to have every direct dial number, and email tested and re-verified every 90 days. With the most comprehensive contact & company data and our excellent customer service, SalesIntel is your partner in growth.
  • 18
    TiMi

    TiMi

    TIMi

    With TIMi, companies can capitalize on their corporate data to develop new ideas and make critical business decisions faster and easier than ever before. The heart of TIMi’s Integrated Platform. TIMi’s ultimate real-time AUTO-ML engine. 3D VR segmentation and visualization. Unlimited self service business Intelligence. TIMi is several orders of magnitude faster than any other solution to do the 2 most important analytical tasks: the handling of datasets (data cleaning, feature engineering, creation of KPIs) and predictive modeling. TIMi is an “ethical solution”: no “lock-in” situation, just excellence. We guarantee you a work in all serenity and without unexpected extra costs. Thanks to an original & unique software infrastructure, TIMi is optimized to offer you the greatest flexibility for the exploration phase and the highest reliability during the production phase. TIMi is the ultimate “playground” that allows your analysts to test the craziest ideas!
  • 19
    Nyxeia Information Governance Suite
    The Information Governance Suite is a set of products aimed at helping organizations to better discover, categorize, enhance, and govern their information assets regardless of the systems in which they are managed. Products in the suite include: - .discover, which connects to information systems to index and categorize unstructured and structured information assets - .policy, which allows organizations to create full lifecycle policies for information retention and disposal - .preserve, for digital asset preservation near the end of the asset lifecycle - .process, for automating content related actions like content categorization to help records teams deal with escalating workload The solution helps identify sensitive information that may reduce compliance with regulations like GDPR, as well as information that may be redundant, trivial, or obsolete.
  • Previous
  • You're on page 1
  • Next