Get the Flash Stretch Assessment. Maximize Tiering to Offset Price Hikes. Learn How

IDC Innovators: Komprise Included in Knowledge Management Technologies

IDC Innovators Vendor Profile: Komprise

IDC Innovators Excerpt Features Komprise

Komprise’s Intelligent Data Management helps the enterprise do two valuable things: unlock the value hidden in unstructured data and reduce storage costs. This is especially relevant for data-intensive industries like pharma/biotech, research, financial, and public sector. Proper metadata tagging and access ensures AI solutions can extract and present the right data, at the right time, and to the right person. Komprise is SaaS solution that is typically run in a hybrid deployment.

Read this IDC Innovator Assessment of Komprise Intelligent Data Management. You can also download the one page summary here.

Key Komprise Differentiators:

  • Analytics UI
  • Global Search capabilities
  • Smart Data Workflows

Read the press release.

“Komprise’s Intelligent Data Management helps the enterprise do two valuable things: unlock the value hidden in unstructured data and reduce storage costs. This is especially relevant for data-intensive industries like pharma/biotech, research, financial, and public sector. Proper metadata tagging and access ensures AI solutions can extract and present the right data, at the right time, and to the right person. Komprise is SaaS solution that is typically run in a hybrid deployment.”

Read this paper to see why IDC has recognized Komprise as an Innovator!

report2024idcinnovator_resource_thumbnail_800x533


What did IDC conclude about Komprise in its Innovators report on knowledge management technologies and why does it matter?

Komprise was named an IDC Innovator in the report: IDC Innovators: Knowledge Management Technologies, recognized for Komprise Intelligent Data Management, a single platform to analyze, move and manage unstructured data. IDC Innovator recognition is awarded to vendors under $100M in revenue that offer a new technology, a groundbreaking solution to an existing issue, or an innovative business model — it is a signal of genuine technical differentiation, not a purchase ranking. What the report concluded:

  • The market problem is structural and accelerating — over 90% of the data enterprises produce is unstructured, according to IDC’s Global Datasphere 2023, and it is a key asset of enterprise intelligence as well as a large part of storage costs; the scale of the problem Komprise addresses is one IDC has been tracking for years and expects to worsen
  • Komprise reduces complexity without getting in the way — Komprise reduces the complexities of managing unstructured data growth with location-agnostic file analysis and indexing that is purpose-built to not disrupt data movement and operations
  • Three specific differentiators called out — IDC noted the Analytics UI, global search capabilities, and Smart Data Workflows as key differentiators of Komprise
  • Relevant across data-intensive industries — Komprise Intelligent Data Management is especially relevant for data-intensive industries like pharma and biotech, research, financial services, and public sector; proper metadata tagging and access ensures AI solutions can extract and present the right data at the right time and to the right person
  • The positioning IDC validated — Komprise is the metadata and orchestration layer for enterprise unstructured AI data; the IDC recognition confirms that treating data management as a layer independent of storage, with intelligence built in from the start, is the correct architectural direction for the AI era

What does IDC’s research reveal about the scale of the enterprise unstructured data problem and how has it evolved since the report was published?

The IDC Innovators report was published against a backdrop of explosive unstructured data growth that has only accelerated since. In 2022, 90% of the data generated by organizations was unstructured, and only 10% was structured; that year organizations globally generated 57,280 exabytes of unstructured data, a volume expected to grow by 28% to over 73,000 exabytes in the following year; additionally, half of survey participants told IDC that their company’s unstructured data is mostly or completely siloed. What has changed since the report:

  • Volume growth has surpassed IDC’s projections — 74% of organizations are now storing more than 5PB of unstructured data, a 57% increase over just the prior year, with 40% managing more than 10PB; the IDC finding that data is mostly siloed has become more acute, not less, as new AI data sources add to the estate
  • Budgets have not kept pace — nearly three-quarters of organizations are spending 30% or more of their IT budget on data storage and protection; the IDC prediction that storage budgets would lag data growth has proven exactly correct
  • The silo problem now directly impedes AI — the top technical challenge for unstructured data management is classifying data for AI, cited by 58% of IT leaders, followed by moving data without disruption at 53%; the siloed, ungoverned data estates IDC described are now the primary obstacle to enterprise AI initiatives
  • Flash prices compound the challenge — the memory shortage is structural, not cyclical; IDC describes the current situation as a potentially permanent strategic reallocation of the world’s silicon wafer capacity, with major memory manufacturers pivoting toward high-margin enterprise-grade AI components; IDC expects 2026 DRAM and NAND supply growth to be below historical norms, at 16% and 17% year-on-year respectively; TrendForce data confirms the impact, with NAND Flash contract prices expected to rise 70 to 75% quarter-on-quarter in Q2 2026 alone, with a clear shortage expected through 2026 and meaningful capacity expansion unlikely until late 2027 or 2028; enterprises storing petabytes of unstructured data on flash NAS are absorbing these increases at exactly the moment budgets are under maximum pressure (source IDC)
  • Komprise is the metadata and orchestration layer that addresses every dimension of this problem simultaneously — cost optimization through intelligent tiering, AI readiness through the Global Metadatabase and Smart Data Workflows, and governance through Sensitive Data Management — from a single platform that works across any unstructured data storage vendor

What is location-agnostic file analysis and indexing and why did IDC identify it as a key Komprise differentiator?

Komprise reduces the complexities of managing unstructured data growth with location-agnostic file analysis and indexing that is purpose-built to not disrupt data movement and operations. Location-agnostic analysis means the platform indexes and manages data based on what that data is and how it is used — not on which vendor’s storage array it happens to reside on. IDC identified this as a differentiator because it is the architectural prerequisite for every subsequent data management decision:

  • Single pane of glass across all silos — Komprise indexes all file and object data across NAS, cloud, and object storage simultaneously, regardless of vendor; an enterprise managing NetApp, Dell, IBM, VAST Data, and AWS S3 simultaneously sees a unified view of all data, all access patterns, all costs, and all risks in a single interface
  • Analysis that does not disrupt operations — Komprise Observers operate out-of-band using standard NFS, SMB, and S3 protocols with no agents installed on storage systems; data management solutions that sit in front of primary storage create a middleman that impacts hot data performance and introduces a single point of failure; Komprise is architecturally different — it is never in the hot data path
  • The Global Metadatabase as the intelligence layer — location-agnostic analysis is what populates the Komprise Global Metadatabase: a continuously updated, cross-silo index of standard and enriched metadata for every file and object across the enterprise; this is the metadata and orchestration foundation that makes Smart Data Workflows, Deep Analytics, and AI data curation possible at petabyte scale
  • KAPPA extends analysis to proprietary formatsKAPPA data services extend location-agnostic analysis to file formats that standard indexing tools cannot read, including DICOM medical images, genomics BAM files, and domain-specific documents, extracting custom metadata attributes using serverless processing and writing them back to the Global Metadatabase
  • Know before you move — the IDC recognition of the Analytics UI as a key differentiator reflects a core Komprise principle: organizations need visibility into what data they have, who uses it, what it costs, and what it is worth before making any data movement, tiering, or AI ingestion decision

What are Komprise Smart Data Workflows and why did IDC single them out alongside AI use cases like chatbot augmentation and image recognition?

IDC noted that in May 2024, Komprise announced Smart Data Workflow Manager, a no-code AI data workflow builder that addresses use cases such as sensitive data identification, chatbot augmentation, image recognition, and more. Smart Data Workflows are automated, policy-driven pipelines that orchestrate the full sequence of discovering, classifying, governing, enriching, and delivering unstructured data to any downstream system — including AI pipelines — without manual intervention. Why IDC called them out:

  • They close the gap between data management and AI — most data management platforms stop at analysis and reporting; Smart Data Workflows go further, translating the intelligence gathered by the Global Metadatabase into automated actions: tier this file, tag this dataset, exclude this PII, deliver this cohort to the AI service
  • No-code accessibility — the Smart Data Workflow Manager lets IT teams and authorized business users build and run complex cross-silo data workflows without scripting or engineering resources; this is what makes the platform practical for data-intensive industries where the workflows needed change frequently as AI use cases evolve
  • Chatbot augmentation requires governed data — feeding a corporate AI assistant or RAG pipeline requires knowing which documents are current, authoritative, and safe to surface; Smart Data Workflows built on the Global Metadatabase deliver exactly this — curated, sensitivity-checked, metadata-enriched document sets that make AI assistants accurate and compliant
  • Image recognition requires precise curation — proper metadata tagging and access ensures AI solutions can extract and present the right data at the right time and to the right person; in medical imaging AI, this means querying petabytes of DICOM files by diagnosis code, modality, and patient cohort using KAPPA-enriched metadata before a single image is moved to an AI pipeline
  • Agentic AI demands runtime orchestration — as AI agents increasingly invoke data retrieval autonomously, Smart Data Workflows and KAPPA functions become the orchestration layer that AI agents call at runtime; Komprise is the metadata and orchestration layer for enterprise unstructured AI data, and IDC’s recognition of Smart Data Workflows confirms this architectural role

What customer outcomes did IDC validate for Komprise, and what has the platform delivered since the report was published?

Komprise customers are enterprises in multiple sectors with petabyte-scale environments, including brand names such as Pfizer, Marriott, Kroger, NYU, and Fossil. The IDC report validated two core outcomes: unlocking the value hidden in unstructured data and reducing storage costs. Both have been delivered at scale and the platform has advanced considerably since IDC published its assessment. What the validated outcomes look like in practice:

  • Storage cost reduction at enterprise scalePfizer reduced storage and cloud costs by 70 to 75% using Komprise intelligent tiering; a major hospital in the southeast is saving $2.5M per year by tiering cold files from on-premises storage to cloud; on 1PB of data, the typical organization pays for 4PB of storage on backup and disaster recovery and 1PB of backup license; Komprise addresses this multiplier by removing cold data from the backup footprint alongside primary storage
  • AI data value unlockedNewYork-Presbyterian used Komprise Smart Data Workflows to achieve 10x faster AI data ingestion and 96% lower cloud costs for its digital pathology AI program; this use case did not exist at the time of the IDC report and demonstrates how the platform has evolved from cost optimization to AI data orchestration
  • Platform advances since the IDC reportKAPPA data services (serverless custom metadata extraction at petabyte scale), Intelligent AI Ingest (2x faster AI data delivery with 70%+ noise filtering), the Elastic Shares patent (near-linear speed-up for AI data processing and migrations), and the Flash Stretch Assessment program (qualified enterprise assessments showing potential savings from intelligent tiering) have all been added since the IDC assessment
  • Market recognition has continued — Komprise has since been named a Leader in the Coldago Research Map for Unstructured Data Management, won the SiliconAngle TechForward Award for Data Storage and Management, received a Gold Stevie Award in Data Tools and Platforms, and achieved four consecutive years on the Inc. 5000 list
  • The IDC thesis has been validated by the market — being named an IDC Innovator indicates how organizations are starting to treat data independently of storage to ascertain and nurture its true value across hybrid cloud infrastructure; two years on, that shift from storage-centric to data-centric management has become the defining architectural conversation in enterprise IT, and Komprise as the metadata and orchestration layer for enterprise unstructured AI data sits at its center