Skip to content
  • HCLSoftware
  • Support
  • Community
  • Downloads
  • Documentation
  • Login
A graphic logo of the Actian Company
  • Products Products
    • blue data icon for Actian

      Data + AI Intelligence

      Actian Data Intelligence Platform New
      Find, trust, and unlock the value of data with a complete governance and marketplace platform
      Actian Data Observability New
      AI-based data quality and observability built for open architecture
      Actian Data Platform
      Easily connect, manage, and analyze data with a unified platform
    • blue Database icon for Actian

      Databases

      • Zen
        Low-maintenance embedded database
      • Actian NoSQL
        Databases for complex object networks
      • Actian Ingres
        Scalable and reliable transactional processing
      • HCL Informix®
        Fast, cost-optimized enterprise data management
    • blue line chart icon for Actian

      Analytics

      • Vector
        High performance, vectorized columnar analytics database
    • integrations

      Data Management

      • DataConnect
        Highly scalable hybrid integration solution
      • Data Quality
        Make informed decisions that drive your business forward
      • DataFlow
        Parallel execution platform data-in-motion
    • Bar Chart blue icon

      App Modernization

      • OpenROAD
        Database, object-oriented rapid app development
      • VoltMX
        Low code app development platform
    • See All Actian Products
    • blue square with right arrow pointing up

      Deployment

      Deployment

      Cloud, hybrid and on-premises

      • Google Cloud Launch your journey to Google with Actian
      • Amazon Web Services Launch your journey to AWS with Actian
      • Microsoft Azure Launch your journey to Azure with Actian
    See All Actian Products Explore All Deployment Partners
  • Solutions Solutions

    Solutions by Industry

    • Manufacturing
    • Transportation and Logistics
    • Banking, Financial Services, and Insurance
    • Healthcare and Life Sciences
    • Retail and Consumer Goods
    • Energy and Utilities

    Use Cases

    • Gen AI icon
      GenAI Data Readiness A quick checklist to evaluate your GenAI readiness
    • blue layer icon for Actian
      Flexible Data Integration Collect, transform, and automate data pipelines
    • database blue icon
      Data Warehouse Modernization Low-risk, simplified migration to a modern data warehouse deployed on-premises and in multiple clouds
    • blue communications solutions icon
      Enterprise Data Marketplace Discover, access, and share data products
    • blue cloud icon for Actian
      Edge-to-Cloud Analytics Modernize application data processing and analytics at the Edge
    • blue dataflow chart icon for Actian
      Customer Data Analytics Hub Get real-time actionable customer intelligence across all your customer experience data silos
    Explore All Industry Solutions
  • Customers Customers

    Customers

    • blue users icon for actian
      Our Customers Join a growing community of businesses across diverse industries who trust Actian to unlock the power of their data

    Featured Customer Stories

    • blue user icon for actian
      Academy Bank
    • blue user icon for actian
      Tsubakimoto
    View All Customers
  • Partners Partners

    Partners

    • blue info square icon for Actian
      Program Overview Competitive solutions, industry-leading incentives and a comprehensive support package
    • blue check icon for Actian
      Become a Partner Accelerate your business with the Actian Partner Program
    • blue Bezier Icon for Actian
      Technology Partners Partnering to create a force multiplier
    • blue user icon for actian
      Refer a Lead Protect your customer, grow your business
    • Find a partner icon
      Find a Partner Leverage expertise and insights from our partner network
  • Learn Learn

    Learn

    • Image Indent Left Icon
      Blog
    • graduation hat blue icon
      Actian Academy
    • book blue icon
      Resources
    • blue icon with paper and magnifying glass for Actian
      Guides
    • blue square
      Webinars
    • blue list logo
      Glossary
    View All Resources
  • Company Company

    Company

    • blue Actian logo
      About Us
    • announcement blue icon
      Newsroom
    • question blue icon
      About HCLSoftware
    • blue briefcase icon for Actian
      Careers
    • blue users icon Actian
      Leadership
    • blue check icon for Actian
      Awards and Recognition
    • Calendar blue icon
      Events
    • message blue icon
      Contact Us
    Learn More About Actian
Take a Tour Request Demo Login
  • Support
  • Community
  • Downloads
  • Documentation
  • HCLSoftware
Learn more about our data solutions
Contact Us
Data Intelligence

What Makes a Data Catalog “Smart”? #3 – Metadata Management

Actian Corporation

February 16, 2022

smart-data-catalog-3-metadata-management

Latest Blog Posts

Keep up with the latest data trends

Subscribe

A data catalog harnesses enormous amounts of very diverse information – and its volume will grow exponentially. This will raise 2 major challenges:

  • How to feed and maintain the volume of information without tripling (or more) the cost of metadata management?
  • How to find the most relevant datasets for any specific use case?

A data catalog should be Smart in order to answer these 2 questions, with smart technological and conceptual features that go wider than the sole integration of AI algorithms.

In this respect we have identified 5 areas in which a data catalog can be “Smart” – most of which do not involve machine learning:

  1. Metamodeling
  2. The data inventory
  3. Metadata management
  4. The search engine
  5. User experience

It is in the field of metadata management that the notion of the Smart Data Catalog is most commonly associated with algorithms, machine learning, and AI.

How is Metadata Management Automated?

Metadata management is the discipline that consists in valuing the metamodel attributes for the inventoried assets. The workload required is usually proportional to the number of attributes in the metamodel and the number of assets in the catalog.

The role of the Smart Data Catalog is to automate this activity as much as possible, or at the very least to help the human operators (Data Stewards) do so in order to ensure greater productivity and reliability.

As seen in our last article, a smart connectivity layer enables the automation of part of the metadata but this automation is very much restricted to a limited subset of the metamodel – mostly technical metadata. A complete metamodel, even a modest one, also has dozens of metadata that cannot be extracted from the source systems registries (because they are not there, to begin with).

To solve this equation, several approaches are possible:

Pattern Recognition

The most direct approach consists in looking to identify patterns in the catalog in order to suggest metadata values for new assets.

Put simply, a pattern will include all the metadata of an asset and the metadata of its relations with other assets or other catalog entities. Pattern recognition is typically done with the help of machine learning algorithms.

The difficulty with the implementation of this approach is precisely qualifying the information assets in a numerical form in order to feed the algorithms and select the relevant patterns. A simple structural analysis is not enough: two datasets can contain identical data but in different structures. Relying on the identity of the data isn’t efficient either: two datasets can contain identical information but with different values. For example, 2020 client invoicing in one dataset, 2021 client invoicing in the other.

In order to solve this problem, the Actian Data Intelligence Platform relies on a technology called fingerprinting. In order to build the fingerprint, we pull up 2 types of features from our clients’ data:

  • A group of features adapted to the numerical data (mostly statistical indicators).
  • Data emanating from word embedding models (word vectorization) for the textual data.

Fingerprinting is at the heart of our intelligent algorithms.

The Other Embedded Approaches in a Suggestion Engine

While pattern recognition is indeed an efficient approach for suggesting the metadata of a new asset in a catalog, it rests on an important prerequisite: in order to recognize a pattern, there has to be one to recognize. In other words, this only works if there are a number of assets in the catalog (which is obviously not the case at the start of a project).

And it’s precisely in these initial phases of a catalog project that the metadata management load is the highest. It is, therefore, crucial to include other approaches likely to help the Data Stewards in these initial phases, when a catalog is more or less empty.

The the Actian Data Intelligence Platform suggestion engine, which provides intelligent algorithms to assist the management of the metadata, also provides other approaches (which we enrich regularly). 

Here are some of these approaches:

  • Structural similarity detection.
  • Fingerprint similarity detection.
  • Name approximation.

This suggestion engine, which analyzes the catalog content in order to determine the probable values of the metadata from the assets that have been integrated, is an everlasting subject of experimentation. We regularly add new approaches, sometimes very simple and sometimes much more sophisticated. In our architecture, it is a dedicated service whose performances improve as the catalog grows and as we enrich our algorithms.

Actian Data Intelligence Platform has chosen to use the lead time as our main measuring metric for the productivity of the Data Stewards (which is the ultimate objective of smart metadata management). Lead time is a notion that stems from lean management and which measures, in a data catalog context, the time elapsed between the moment an asset is inventoried and the moment all its metadata has been valued.

For more information on how Smart metadata management enhances a Data Catalog, download our eBook: “What is a Smart Data Catalog?”.

actian avatar logo

About Actian Corporation

Actian makes data easy. Our data platform simplifies how people connect, manage, and analyze data across cloud, hybrid, and on-premises environments. With decades of experience in data management and analytics, Actian delivers high-performance solutions that empower businesses to make data-driven decisions. Actian is recognized by leading analysts and has received industry awards for performance and innovation. Our teams share proven use cases at conferences (e.g., Strata Data) and contribute to open-source projects. On the Actian blog, we cover topics ranging from real-time data ingestion, data analytics, data governance, data management, data quality, data intelligence to AI-driven analytics.
  • Data Catalog
  • Metadata Management
  • Share withTwitter Icon
  • Share withLinkedin Icon
  • Share withFacebook Icon
  • Share withMail Icon

Subscribe to the Actian Blog

Subscribe to Actian’s blog to get data insights delivered
right to you.

  • Stay in the know – Get the latest in data analytics pushed directly to your inbox.
  • Never miss a post – You’ll receive automatic email updates to let you know when new posts are live.
  • It’s all up to you – Change your delivery preferences to suit your needs.

Subscribe

This email extension () is not allowed. Please update.
This personal email address domain () is not allowed. Please update.

Thank you for subscribing to the Actian Blog!

Get ready to stay informed and inspired with the latest insights, trends, and updates in the world of data analytics and technology.

Expect our carefully curated articles, case studies, and industry news to land in your inbox soon.

Also of Interest:
  • Data Intelligence for Smarter Decisions
  • Get a 360-Degree Customer View
  • Flexible Data Integration

Data + AI Intelligence

  • Actian Data Intelligence Platform
  • Actian Data Observability
  • Actian Data Platform

Capabilities

  • Data Analytics
  • Databases
  • Data Integration & Quality
  • Application Services

Solutions

  • Manufacturing
  • Financial Services
  • Healthcare Data Analytics
  • Transportation & Logistics
  • Communications

Company

  • About Actian
  • About HCLSoftware
  • Events
  • Awards & Recognition
  • Newsroom
  • Press
  • Careers
  • Locations

Customers

  • Support
  • Community
  • Documentation
  • Customer Portal Login
  • Actian Data Platform Login

Get Started

  • Request Demo
  • Contact Us
Actian
© 2025 Actian Corporation. All Rights Reserved.
  • x social icon
  • facebook
  • Linkedin
  • GitHub
  • youtube
  • Terms of Use
  • Modern Slavery Policy
  • Privacy Policy
  • Trademark Guidelines
  • Patents
  • Security
hcl-logo