“We were impressed by the functionality of MetadataHub right from the start, and by how quickly new file formats were integrated for us.”
Carsten Schäuble
Head of IT - ZUSE Institut Berlin
MetadataHub provides a comprehensive overview of data stored on NAS, in the cloud, and in object storage; it analyzes file and embedded metadata and makes unstructured data efficiently usable for search, analysis, AI, and workflows.
MetadataHub analyzes, indexes, and links unstructured data regardless of its location—without altering your data. With intelligent search, tagging, and metadata analysis, accessing information
has never been easier.
The central layer for all your data sources
MetadataHub acts as a unified unstructured data layer across all your storage locations—whether on-premises, in the cloud, or in a hybrid environment.
Data doesn’t need to be moved: The Hub indexes it directly at its storage location, makes it searchable, and connects silos into a unified view. This saves end users and IT teams time and provides transparency across file servers, archive systems, object storage, and more.
Technical Fundamentals
From analysis and tagging to AI data provision: MetadataHub transforms unstructured data into actionable insights. Easy to integrate and intuitive to use—for data science, IT, and business departments alike.
Context-based search filters large volumes of data by file type, age, attributes, and content. This reduces
your processing time.
Metadata can be assigned via the UI or API—regardless of the storage system. This creates a consistent information space.
Archive systems are fully indexed and searchable. Only filtered records need to be retrieved—saving both time and money.
MetadataHub works with any storage system that can be connected via SMB, NFS, or S3. Every component is containerized.
With its modern architecture, standard protocols, and scalable design, MetadataHub can be seamlessly integrated into any enterprise environment.
System Architecture & Integration
MetadataHub is based on a containerized microservices architecture and can be flexibly integrated into any environment. It supports NAS, object storage, and cloud systems alike and indexes millions of files without requiring infrastructure changes. Search queries can be saved, automated, or transferred to third-party systems via API.
This creates structured data pipelines for analytics, AI, and compliance. At the same time, administrators retain full control over data access and analysis at all times.
Security and Management Features
From research and government agencies to industry—MetadataHub is transforming data landscapes around the world.
Here you’ll find answers to the most important questions about architecture, search, tagging, integration, and performance. Ideal for organizations that want to use unstructured data more efficiently and make their storage landscape more transparent.
MetadataHub is a platform for cataloging, analyzing, and utilizing metadata across unstructured data. It collects and indexes metadata from NAS, object storage, and cloud systems, making data discoverable and ready for analysis, AI workloads, compliance, and operational workflows.
It brings transparency to unstructured data sets, reduces search times, improves data quality and governance, and enables automated workflows for data analysis and AI models—without requiring any changes to existing storage infrastructures.
MetadataHub supports NAS shares, object storage (S3-compatible), and cloud storage systems. It analyzes and indexes metadata from a wide variety of file formats, including embedded metadata.
The solution processes millions of files using a containerized microservices architecture. This allows large volumes of data to be indexed efficiently without having to modify existing infrastructure or storage locations.
MetadataHub captures both file system metadata (e.g., name, size, modification date) and embedded metadata from various formats (e.g., EXIF, XMP, document properties, application metadata) to create a comprehensive data profile.
Centralized metadata indexing and analysis make it easy to implement retention policies, classification, and audit trails. Administrators maintain full visibility into data access and usage.
Yes. Search queries can be saved, run on a regular basis, or transferred to third-party systems via APIs to create automated processes or workflows.
MetadataHub is based on a containerized microservices architecture and can be flexibly integrated into on-premises, hybrid, or cloud environments. APIs enable integration with external tools and platforms.
Yes. High-quality data is the foundation for successful processing by AI. MetadataHub sifts through the vast amount of unstructured data to identify the right data for successful processing by AI.