Data Management Glossary
Global File Index
What is a Global File Index?
Komprise Deep Analytics enables precise unstructured data management at enterprise scale, creating a Global File Index, which is a metadata catalog, delivering the benefits of Global Namespace or Global File System data access without sitting in front of the hot data path. Spanning petabytes of file and object data sources, the Global File Index allows enterprise customers to find specific data sets and then create a data management policy or Smart Data Workflow to systematically take action on your data set. Unstructured data ends up in multiple silos, so an index needs to be global across different data centers, storage, backup and cloud infrastructure and it must not sit in front of the hot data path to ensure there is no impact on data storage performance.
Once you connect Komprise to your file and object storage, your data is indexed and a Global File Index, which is a global metadata catalog across disparate file and object data, is created. You do not have to move the data anywhere; but you now have a single way to query and search across your file and object stores. Say you have some NetApp, some Isilon, some Windows servers, some Pure Storage at different sites and you have some cloud file storage on AWS, Azure, and Google. You get a single index via Komprise of all the data across all these environments and now you can search and find exactly the data you need with a single console and API.
Benefits of the Global File Index
- Users only move the data they need, with the ability to create queries on countless file attributes and tags such as: data related to a specific tag or project name, projects that are no longer active, file age, user/group ID’s, path, file type (aka JPEG) and specific extensions, data with unknown owners.
- A global metadata catalog eliminates the manual effort of finding custom data sets and moving them separately from different storage silos since Komprise can create a virtual data set based on the query and systematically and continuously move data from multiple file and object silos to the target location.
- Improves IT and business collaboration around data, as data owners/users can participate in data tiering.
Watch the TechKrunch session: Deep Analytics Actions with One Global File Index
Search and Act on Unstructured Data Insights
Deep Analytics Actions provides a systematic way to find specific file and object data across hybrid cloud storage silos and move just the right subset of unstructured data for new uses such as AI/ML and cloud analytics. This gives IT and storage departments the ability to drive closer connections with end users by liberating the nuggets of useful data from petabytes of files, so that new value and customer-facing benefits can be discovered.
Smart Data Workflows take Deep Analytics Actions a step further by allowing IT users and/or storage admins to create automated workflows for all the steps required to find the right unstructured data across storage assets, tag and enrich the data and send it to external tools for analysis. This eliminates manual effort in unstructured data management and helps organizations speed time to value from cloud-native and other tools.