Data Intelligence

7 Lies of Data Catalog Providers #1: Not a Data Governance Solution

Actian Corporation

June 16, 2021

a data catalog is not a governance solution

The Data Catalog market has developed rapidly, and it is now deemed essential when deploying a data-driven strategy. Victim of its own success, this market has attracted a number of players from adjacent markets.

 These players have rejigged their marketing positioning to present themselves as Data Catalog solutions.

The reality is that, while relatively weak on the data catalog functionalities themselves, these companies attempt to convince, with degrees of success proportional to their marketing budgets, that a Data Catalog is not merely a high-performance search tool for data teams, but an integrated solution likely to address a host of other topics.

The purpose of this blog series is to deconstruct the pitch of these eleventh-hour Data Catalog vendors.

A Data Catalog is NOT a Data Governance Solution

This is probably our most controversial stance on the role of a Data Catalog and the controversy originates with the powerful marketing messages pumped out from the world leader in metadata management whose solution is in reality a data governance platform being sold as a Data Catalog.

To be clear, having sound data governance is one of the pillars of an effective data strategy. Governance, however, has little to do with tooling.

Its main purpose is the definition of roles, responsibilities, company policies, procedures, controls, committees. In a nutshell, its function is to deploy and orchestrate, in its entirety, the internal control of data in all its dimensions.

Let’s just acknowledge that data governance has many different aspects (processing and storage architecture, classification, retention, quality, risk, conformity, innovation, etc.) and that there aren’t any universal “one-size fits all” model adapted for all organizations. Like other governance domains, each organization must conceive and pilot its own landscape based on its capacities and ambitions, as well as thorough risk analysis.

Putting in place an effective data governance is not a project, but rather it is a transformation program.

No commercial “solution” can replace that transformation effort.

So Where Does the Data Catalog fit into All This?

The quest for a Data Catalog is usually the result of a very operational requirement: Once the Data Lake and a number of self-service tools are set up, the next challenge quickly becomes to find out what the Data Lake actually contains (both from a technical and a semantic perspective), where the data comes from, what transformations the data may have incurred, who is in charge of the data, what internal policies apply to the data, who is currently using the data and why etc.

An inability to provide this type of information to the end-user can have serious consequences to an organization, and a Data Catalog is the best means to mitigate that risk. When dealing with the selection of a transverse solution, involving people from many different departments, the selection of the solution is often given to those in charge of data governance, as they appear to be in the best position to coordinate the expectations of the largest number of stakeholders.

This is where the alchemy begins. The Data Catalog, whose initial purpose was to provide data teams with a quick solution to discover, explore, understand, and exploit the data, becomes a gargantuan project in which all aspects of governance have to be solved.

The project will be expected to:

  • Manage data quality.
  • Manage personal data and compliance (GDPR first and foremost).
  • Manage confidentiality, security, and data access.
  • Propose a new Master Data Management (MDM).
  • Ensure a field by field automated lineage for all datasets.
  • Support all the roles as defined in the system of governance and enable the relevant workflow configuration.
  • Integrate all the business models produced in the last 10 years for the urbanization program.
  • Authorize crossed querying on the data sources while complying with user habilitation on those same sources, as well as anonymizing the results.

Certain vendors manage to convince their client that their solution can be this unique one-stop-shop to data governance. If you believe this is possible, by all means call them, they will gladly oblige. But to be frank, we simply do not believe such a platform is possible, or even desirable. Too complex, too rigid, too expensive and too bureaucratic, this kind of solution can never be adapted to a data-centric organization.

For us, the Data Catalog plays a key role in a data governance program. This role should not involve supporting all aspects of governance but should rather be utilized to facilitate communication and awareness of governance rules within the company and to help each stakeholder become an active part of this governance.

In our opinion, a Data Catalog is one of the components that delivers the biggest return on investment in data-centric organizations that rely on Data Lakes with modern data pipelines…provided it can be deployed quickly and has a reasonable pricing associated with it.

Take Away

A Data Catalog is not a data governance management platform.

Data governance is essentially a transformation program with multiple layers that cannot be addressed by one single solution. In a data-centric organization, the best way to start, learn, educate, and remain agile is to blend clear governance guidelines with a modern Data Catalog that can share those guidelines with the end users.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale. Actian data intelligence solutions help streamline complex data environments and accelerate the delivery of AI-ready data. Designed to be flexible, Actian solutions integrate seamlessly and perform reliably across on-premises, cloud, and hybrid environments. Learn more about Actian, the data division of HCLSoftware, at actian.com.
Data Intelligence

Data Governance Framework | S03-E02 – Start in Under 6 Weeks

Actian Corporation

June 9, 2021

This is the last episode of our third and final season of “The Effective Data Governance Framework”.

Divided into two episodes, this final season will focus on the implementation of metadata management with a data catalog.

In this final episode, we will help you start a 3-6 week data journey and then deliver the first iteration of your Data Catalog.

Season 1: Alignment

Evaluate your Data maturity

Specify your Data strategy

Getting sponsors

Build a SWOT analysis

Season 2: Adapting

Organize your Data Office

Organize your Data Community

Creating Data Awareness

Season 3: Implementing Metadata Management with a Data Catalog

The importance of metadata

6 weeks to start your data governance journey

Metadata Governance Iterations

We are using an iterative approach based on short cycles (6 to 12 weeks at most) to progressively deploy and extend the metadata management initiative in the Data Catalog.

These short cycles make it possible to quickly obtain value. They also provide an opportunity to communicate regularly via the Data Community on each initiative and its associated benefits.

Each cycle is organized in predetermined steps, as follows:

1. Identify the Goal

A perimeter (data, people), a target.

2. Deploy / Connect

Technical configuration of scanners and ability to harvest the information.

Scanners deployed and operational.

3. Conceive and Configure

A metamodel tailored to meet expectations.

4. Import the Items

Define the core (minimum viable) information to properly serve the users.

5. Open and Test

Validate if the effort produced the expected value.

6. Measure the Gains

Fine grained analysis of the cycle to identify what worked, what didn’t and how to improve the next cycle.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale. Actian data intelligence solutions help streamline complex data environments and accelerate the delivery of AI-ready data. Designed to be flexible, Actian solutions integrate seamlessly and perform reliably across on-premises, cloud, and hybrid environments. Learn more about Actian, the data division of HCLSoftware, at actian.com.
Data Intelligence

Data Strategy: How to Break Down Data Silos

Actian Corporation

June 8, 2021

data-silos

Whether it comes from Product life cycles, marketing, or customer relations, data is omnipresent in the daily life of a company. Customers, suppliers, employees, partners… they all collect, analyze and exploit data in their own way.

The risk: The appearance of silos. Let’s discover why your data is siloed and how to put an end to it.

A company is made up of different professions that coordinate their actions to impose themselves on their market and generate profit. Each of these professions fulfill specific missions and collect data. Marketing, sales, customer success teams, communication…all of these entities act on a daily basis and base their actions on their own data.

The problem is that, over the course of their career, a customer will generate a certain amount of information.

A simple lead then becomes a prospect, who then becomes a customer. The same person may have different taxonomies based on which part of the business is analyzing this data.

This reality is what we call a data silo. In other words, data is poorly or never shared and therefore too often untapped. 

In a study by IDC entitled “The Data-Forward Enterprise” published in December 2020, 46% of French companies forecast a 40% annual growth in the volume of data to be processed over the next two years.

Nearly 8 out of 10 companies consider data governance to be essential. However, only 11% of them believe they are getting the most out of their data. The most common reason for this is data silos.

What are the Major Consequences of Data Silos?

Among the frequent problems linked to data silos, we find first and foremost the problem of duplicated data. Since data is used blindly by the business, what could be more natural?

These duplicates have unfortunate consequences. They distort the knowledge you can have of your products or your customers. This biased, imperfect information often leads to imprecise or even erroneous decisions.

Duplicated data also take up unnecessary space on your servers. Storage space that represents an additional cost for your company! Beyond the impact of data silos on your company’s decisions, strategies, or finances, there is also the organizational deficit.

When your data is in silos, your teams can’t collaborate effectively because they don’t know if they’re mining the same soil.

At a time where collective intelligence is a cardinal value, this is undoubtedly the most harmful event caused by data silos.

Does Your Company Suffer From Data Silos?

There are many causes for siloed data. Most often, they are associated with the history of your information systems. Over the years, these systems were built as a patchwork for business applications that were not always designed with interoperability in mind.

Moreover, a company is like a living organism. It welcomes new employees when others leave. In everyday life, spreading data culture throughout the workforce is a challenge! Finally, there is the place of data in the key processes of organizations.

Today data is central. But when you go back 5 to 10 years ago, it was much less so. Now that you know that you are suffering from data silos, you need to take action. 

How do you get rid of Data Silos?

To get started on the road to eradicating data silos, you need to proceed methodically.

Start by recognizing that the process will inevitably take some time. The prerequisite is a creating a detailed mapping of all your databases and information systems. These can be produced by different tools and solutions such as emails, CRMs, various spreadsheets, financial documents, customer invoices, etc.

It is also necessary to start by identifying all your data sources in order to centralize them in a unique repository. To do this, you can for example create gaps between the silos by using specific connectors, also called APIs. The second option is to implement a platform on your information system that will centralize all the data.

Working as a data aggregator, this platform will also consolidate data by tracking duplicates and keeping the most recent information. A Data Catalog Solution will prevent the reappearance of data silos once deployed.

But beware, data quality, optimized circulation between departments, and coordinated use of data to improve performance is also a human project.

Sharing best practices, training, raising awareness – in a word, creating a data culture within the company – will be the key to eradicating data silos once and for all.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale. Actian data intelligence solutions help streamline complex data environments and accelerate the delivery of AI-ready data. Designed to be flexible, Actian solutions integrate seamlessly and perform reliably across on-premises, cloud, and hybrid environments. Learn more about Actian, the data division of HCLSoftware, at actian.com.
Data Intelligence

Essential Keys to a Successful Cloud Migration

Actian Corporation

June 8, 2021

data transfer to cloud technology data storage,futuristic of data transfer,online data storage technology. vector illustration

The recent COVID-19 pandemic has brought about major changes in the work culture, and the Cloud is becoming an essential part of that culture by offering employees access to the company’s data, wherever they are. But why migrate? How do you migrate? And for what benefits? Here is an overview:

Head in the clouds and feet on the ground, that’s the promise of the Cloud, which has proven to be an essential tool for business continuity during the health crisis.

In a study conducted by Vanson Bourne at the end of 2020, it appears that more than 8 out of 10 business leaders (82%), accelerated their decision to migrate their critical data and business functions to the Cloud, after facing the COVID-19 crisis. 91% of survey participants say they have become more aware of the importance of data in the decision-making process since the crisis began.

Cloud and data. A duo that is now inseparable from business performance.

A reality that is not limited to a specific market. The plebiscite for Cloud data migration is almost worldwide. The Vanson Bourne study highlights a shared awareness on an international scale, with edifying figures:

  • United States (97%)
  • Germany and Japan (93%)
  • United Kingdom (92%)

Finally, 99% of Chinese executives are accelerating their plans to complete their migration to the Cloud. In this context, the question “Why migrate to the Cloud” is unequivocally answered: if you don’t, your competitors will do it before you and will definitely beat you to it.

The Main Benefits of Cloud Migration

Ensuring successful Cloud data migration is first and foremost a question of guaranteeing its availability in all circumstances. Once stated, this benefit leads to many others. If data is accessible everywhere and at all times, a company is able to meet the demand for mobility and flexibility expressed by employees.

A requirement that was fulfilled during the successive confinements and that should continue as the return to normalcy seems finally possible. Fully operational employees at home, in the office or in the countryside, not only promise increased productivity but also a considerable improvement in the user experience. HR benefits are not the only consequences of Cloud migration.

From a financial point of view, the Cloud opens the way to a better control of IT costs. By shifting data from a CAPEX dimension to an OPEX dimension, you can improve the TCO (Total Cost of Ownership) of your information system and your data assets. Better experience, budget control, the Cloud opens the way to optimized data availability.

Indeed, when migrating to the Cloud, your partners make commitments in terms of maintenance or backups that guarantee maximum access to your data. You should therefore pay particular attention to these commitments, which are referred to as SLAs (Service Level Agreements).

Finally, by migrating data to the cloud, you benefit from the expertise and technical resources of specialized partners who deploy resources that are far superior to those that you could have on your own.

How to Successfully Migrate to the Cloud

Data is, After Human Resources, the Most Valuable Asset of a Company

This is one of the reasons why companies should migrate to the Cloud. But the operation must be carried out in the best conditions to limit the risk of data degradation, as well as the temporary unavailability that impacts your business.

To do this, preparation is essential and relies on one prerequisite: the project does not only concern IT teams, but the entire company. 

Support, reassurance, training: the triptych that is essential to any change management process must be applied. Then make sure you give yourself time. Avoid the Big Bang mode, which could irritate your teams and dampen their enthusiasm. Even if the Cloud migration of your data should go smoothly, put all the chances on your side by making backups of your data.

Rely on redundancy to prepare for any eventuality, including (and especially!) the most unlikely. Once the deployment on the cloud is complete, ensure the quality of the experience for your employees. By conducting rigorous long-term project management, you can easily identify if you need to make adjustments to your initial choices.

The scalability of the Cloud model is a strength that you should seize upon to constantly adapt your strategy.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale. Actian data intelligence solutions help streamline complex data environments and accelerate the delivery of AI-ready data. Designed to be flexible, Actian solutions integrate seamlessly and perform reliably across on-premises, cloud, and hybrid environments. Learn more about Actian, the data division of HCLSoftware, at actian.com.
Data Intelligence

Data Governance Framework | S03-E01 – Importance of Metadata

Actian Corporation

June 2, 2021

zeenea effective data governance season 3 episode 1

This is the first episode of our third and final season of “The Effective Data Governance Framework”.

Divided into two episodes, this final season will focus on the implementation of metadata management with a data catalog.

For this first episode, we will give you the right questions to ask yourself to build a metamodel for your metadata.

Season 1: Alignment

Evaluate your Data maturity

Specify your Data strategy

Getting sponsors

Build a SWOT analysis

Season 2: Adapting

Organize your Data Office

Organize your Data Community

Creating Data Awareness

Season 3: Implementing Metadata Management with a Data Catalog

The importance of metadata

6 weeks to start your data governance journey

In our previous Season, we explained gave you our tips on how to build your Data Office, organize your Data Community, and build your Data Awareness.

In this third season, you will step into the real world of implementing a Data Catalog where Seasons 1 and 2 helped you to specify your Data Journey Strategy.

In this episode, you will learn how to ask the right questions for designing your Metamodel.

The Importance of Metadata

Metadata management is an emerging discipline and is necessary for enterprises wishing to bolster innovation or regulatory compliance initiatives on their data assets.

Many companies are therefore trying to establish their convictions on the subject and brainstorm solutions to meet this new challenge. As a result, metadata is increasingly being managed, alongside data, in a partitioned and siloed way that does not allow the full, enterprise-wide potential of this discipline.

Before beginning your data governance implementation, you will have to cover different aspects, ask yourself the right questions and figure out how to answer them.

Our Metamodel Template is a way to identify the main aspects when it comes to data governance by asking the right questions and in each case, you decide on its relevance.

These questions can also be used as support for your data documentation model and can provide useful elements to data leaders.

The Who

  • Who created this data?
  • Who is responsible for this data?
  • Who does this data belong to?
  • Who uses this data?
  • Who controls or audits this data?
  • Who is accountable on the quality of this data?
  • Who gives access to this data?

The What

  • What is the “business” definition for this data?
  • What are the associated business rules of this data?
  • What is the security/confidentiality level of this data?
  • What are the acronyms or aliases associated with this data?
  • What are the security/confidentiality rules associated with this data?
  • What is the reliability level (quality, velocity, etc.) of this data?
  • What are the authorized contexts of use (related to confidentiality for example)?
  • What are the (technical) contexts of use possible (or not) for this data?
  • Is this data considered a “Golden Source”?

The Where

  • Where is this data located?
  • Where does this data come from? (a partner, open data, internally, etc.)
  • Where is this data used/shared?
  • Where is this data saved?

The Why

  • Why are we storing this data? (rather than treating its flow)?
  • What is this data’s current purpose/usage?
  • What are the possible usages for this data? (in the future)

The When

  • When was the data created?
  • When was this data last updated?
  • What is this data’s life cycle? (update frequency)?
  • How long are we stocking this data for?
  • When does this data need to be deleted?

The How

  • How is this data structured? (diagram)?
  • How do your systems consume this data?
  • How do you access this data?

Start Defining Your Metamodel Template

These questions can serve as a foundation for building your data documentation model and providing data consumers with the elements that are useful to them.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale. Actian data intelligence solutions help streamline complex data environments and accelerate the delivery of AI-ready data. Designed to be flexible, Actian solutions integrate seamlessly and perform reliably across on-premises, cloud, and hybrid environments. Learn more about Actian, the data division of HCLSoftware, at actian.com.
Data Intelligence

Data Governance Framework | S02-E03 – Data Awareness

Actian Corporation

May 30, 2021

zeenea effective data governance framework episode 3 season 2

This is the final episode of the second season of the “The Effective Data Governance Framework” series.

Divided into three parts, this second part will focus on Adaptation. This consists of: 

  • Organizing your Data Office.
  • Building a data community.
  • Creating Data Awareness.

For this third and final episode of the season, we will help you use awareness support techniques that reduce the efforts needed to realize communicative tasks, make anyone aware of what the Data Governance Team is doing, and get buy-in and alignment at all levels.

Season 1: Alignment

Evaluate your Data maturity

Specify your Data strategy

Getting sponsors

Build a SWOT analysis

Season 2: Adapting

Organize your Data Office

Organize your Data Community

Creating Data Awareness

Season 3: Implementing Metadata Management With a Data Catalog

The importance of metadata

6 weeks to start your data governance journey

In the last episode, we explained how to organize your Data Community by building your Data Chapters and Data Guilds

In this episode, we will help you use awareness support techniques that reduce the effort needed to realize communicative tasks and create data awareness on the enterprise level.

We advise to use the SMART framework to plan and execute the Data Awareness program.

What are SMART Goals?

  • Specific: What do you want to accomplish? Why is this goal important? Who is involved? What resources are involved?
  • Measurable: Are you able to track your progress? How will you know when it’s accomplished?
  • Achievable: Is achieving this goal realistic with effort and commitment? Do you have the resources to achieve this goal? If not, how will you get them?
  • Relevant: Why is this goal important? Does it seem worthwhile? Is this the right time? Does this match efforts/needs?
  • Timely: When will you achieve this goal?

The “SMART” Method for Your Data Teams

If you think about the level of reach a team has, you can summarize them in 3 categories:

  • The Control sphere is the one your Data Team can reach directly and interacts
  • The Influence sphere is the level where you can find sponsors and get help from
  • The Concern sphere consists of the C levels who need to be informed on how things are progressing from a high level perspective.

In other words, you will have to touch all the stakeholders involved but with different means, timing and interactions.

Spend time creating nice formats, and pay attention to the form of all your artifacts.

Examples of SMART Tasks

You fill find below examples of SMART tasks:

For the Control sphere, we advise you to do the following:

  • Deliver trainings (for both Data Governance teams as well as End users).
  • Deliver presentations dedicated to teams (Strategy, OKRs, Roadmap, etc).
  • Keep your burn-down charts and all visual management tools displayed at any time.

For the Influence sphere, we advise you to:

  • Celebrate your first milestones.
  • Organize sprint demos.
  • Display OKRs teams constantly.

And for the Concern sphere, we advise you to:

  • Celebrate the end of a project.
  • Organize product demos.
  • Record videos and make them available.
actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale. Actian data intelligence solutions help streamline complex data environments and accelerate the delivery of AI-ready data. Designed to be flexible, Actian solutions integrate seamlessly and perform reliably across on-premises, cloud, and hybrid environments. Learn more about Actian, the data division of HCLSoftware, at actian.com.
Data Intelligence

Data Governance Framework | S02-E02 – Data Community

Actian Corporation

May 18, 2021

zeenea effective data governance framework episode 2 season 2

This is the second episode of the second season of  “The Effective Data Governance Framework” series.

Divided into three parts, this second part will focus on Adaptation. This consists of: 

  • Organizing your Data Office.
  • Building a data community.
  • Creating Data Awareness.

For this second episode, we will give you the keys to organizing an efficient and effective data community in your company.

Season 1: Alignment

Evaluate your Data maturity

Specify your Data strategy

Getting sponsors

Build a SWOT analysis

Season 2: Adapting

Organize your Data Office

Organize your Data Community

Creating Data Awareness

Season 3: Implementing Metadata Management With a Data Catalog

The importance of metadata

6 weeks to start your data governance journey

Spotify Feature Teams: A Good Practice, or a Failure?

In the last episode, we explained how to build your Data Office with Personas and the Spotify Feature Teams paradigm.

The Spotify model has been criticized because there have been failures at companies that tried to implement it.

The three main reasons were:

  • Autonomy is nice but it does not mean that teams can do what they want and there is a need to emphasize alignment.
  • Key results need to be defined at the leadership level and this is why building your OKRs are the right thing to do.
  • Autonomy means accountability and the teams have to be measured and the fact that the increments they are working on need to be done and the definition of “Done” has to be specified.

We will focus in this episode on the Chapters and Guilds  and how to organize and better leverage your Data Community.

How to Organize Your Chapters and Guilds

Chapters

Collaboration in Chapters and Guilds needs specific knowledge and experience and it is wrong to assume that teams know Agile Practices.

When teams are growing, there is a need to have dedicated support and therefore, the Program Managers in charge of data related topics are accountable for the processes and organization of the Data Community.

At the highest level, organizing your data community means sharing knowledge at all levels: technological, functional, or even specific practices around data related topics.

The main drivers to focus on the Chapters organization are:

  • Teams miss information.
  • Teams miss knowledge.
  • Teams repeat mistakes.
  • Teams need ceremonies and agile common agreed practices.

Chapters meet regularly and often.

We advise to meet once a month. When too big, a Chapter can be split into smaller groups. Even if it is a position that can change overtime, a Chapter needs a leader, and not a manager.

They are in charge of animating and making it efficient by

  • Getting the right people involved.
  • Sharing outcomes with upper level management.
  • Coordinating and moderating meetings.
  • Helping to establish transparency.
  • Finding a way of sharing and keeping available all the knowledge shared.
  • Defining the Chapter: why, for whom and what it is meant for.

A tip is to define an elevator pitch for the Chapter.

The Chapter leader is also responsible for building a backlog to avoid endless discussions with no outcome.

Typically the backlog consists in the following topics:

Data Topics

  • Chapter data people culture.
  • Chapter data related topics in continuous improvement.
  • Chapter data practices.
  • Chapter data processes.
  • Chapter data tools.

Generic Topics

  • Chapter continuous improvement.
  • Chapter feedback collection.
  • Chapter agility practices.
  • Chapter generic tools.
  • Chapter information sharing.
  • Chapter education program.

The Chapter Lead is in charge of communicating outside of his Chapter with other Chapter leaders and has to get time allocation to animate.

How to Start a Chapter

  • Identify the community and all members.
  • Name the Chapter.
  • Organize the first chapter meeting.
  • Define elevator statement.
  • Initialize your the Chapter web page (and keep it updated for future new members onboarding).
  • Negotiate and build the first backlog.
  • Plan the meetings.

Guilds

Guilds should be organized differently and in a self organized way.

The reason for Guilds to exist is passion and the teams are only built on a voluntary base.

In order to avoid the syndrome of too many useless meetings, we advise to allow only Guilds to meet in certain circumstances like:

  • Trainings, workshops but in short formats like in BBLs (Brown Bag Lunch) for the topics they built the Guild for.
  • Q&A sessions with top executives to emphasize the Why of the data strategy.
  • Hack days to crack a topic.
  • Post mortem meetings after a major issue has occurred.
actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale. Actian data intelligence solutions help streamline complex data environments and accelerate the delivery of AI-ready data. Designed to be flexible, Actian solutions integrate seamlessly and perform reliably across on-premises, cloud, and hybrid environments. Learn more about Actian, the data division of HCLSoftware, at actian.com.
Data Intelligence

Data Governance Framework | S02-01 – Organizing Your Data Office

Actian Corporation

May 17, 2021

organizing your data office

This is the first episode of the second season of “The Effective Data Governance Framework” series.

Divided into three parts, this second part will focus on Adaptation. This consists of: 

  • Organizing your Data Office
  • Building a data community  
  • Creating Data Awareness

For this first episode, we will give you the keys to building your data personas and setting up a clear and well-defined Data Office. 

Season 1: Alignment

Evaluate your Data maturity

Specify your Data strategy

Getting sponsors

Build a SWOT analysis

Season 2: Adapting

Organize your Data Office

Organize your Data Community

Creating Data Awareness

Season 3: Implementing Metadata Management With a Data Catalog

The importance of metadata

6 weeks to start your data governance journey

In the first season, we shared our best practices to help you align your data strategy with your company. For us, it is essential to:

  • Assess the maturity of your data.
  • Specify your Data Strategy by building OKRs.
  • Get sponsorship.
  • Build an effective SWOT analysis.

In this first episode, we will teach you how to build your Data Office.

The Evolution of Data Offices in Companies

We believe in Agile Data Governance.

Previous implementations of data governance within organizations have rarely been successful. The Data Office often focuses too much on technical management or a strict control of data.

For data users who strive to experiment and innovate around data, Data Office behavior is often synonymous with restrictions, limitations, and cumbersome bureaucracy.

Some will have gloomy visions of data locked up in dark catacombs, only accessible after months of administrative hassle. Others will recall the wasted energy at meetings, updating spreadsheets and maintaining wikis, only to find that no one was ever benefiting from the fruits of their labor.

Companies today are conditioned by regulatory compliance to guarantee data privacy, data security, and to ensure risk management.

That said, taking a more offensive approach towards improving the use of data in an organization by making sure the data is useful, usable and exploited is a crucial undertaking.

Using modern organizational paradigms with new ways of interacting is a good way to set up an efficient Data Office flat organization.

Below are the typical roles of a Data Office, although very often, some roles are carried out by the same person:

  • Chief data officer
  • Data related Portfolio/Program/Project managers
  • Data Engineers / Architects
  • Data scientists
  • Data analysts
  • Data Stewards

Creating Data Personas

An efficient way of specifying the roles of Data Office stakeholders is to work on their personas.

By conducting one on one interviews, you will learn a lot about them: context, goals and expectations. The OKRs map is a good guide for building those by asking accurate questions.

Here is an example of a persona template:

Some Useful Tips:

  • Personas should be displayed in the office of all Data Office team members.
  • Make it fun, choose an avatar or a photo for each team member, write a small personal and professional bio, list their intrinsic values and work on the look and feel.
  • Build one persona for each person, don’t build personas for teams
  • Be very precise in the personas definition interviews, rephrase if necessary.
  • Treat people with respect and consider all ideas equally.
  • Print them and put them on the office walls for all team members to see.

Building Cross-Functional Teams

In order to get rid of Data and organizational silos, we recommend you organize your Data Office in Feature Teams (see literature on the Spotify feature teams framework on the internet).

The idea is to build cross functional teams to address a specific feature expected by your company.

The Spotify Model Defines the Following Teams:

Squads

Squads are cross-functional, autonomous teams  that focus on one feature area. Each Squad has a unique mission that guides the work they do. 

In season 1, episode 2, in our OKRs example, the CEO has 3 OKRs and the first OKR (Increase online sales by 2%) has generated 2 OKRs:

  • Get the Data Lake ready for growth, handled by the CIO
  • Get the data governed for growth, handled by the CDO.

There would then be 2 squads:

  • Feature 1: get the Data Lake ready for growth
  • Feature 2: get data governed for growth.

Tribes

At the level below, multiple Squads coordinate within each other on the same feature area. They form a Tribe. Tribes help build alignment across Squads. Each Tribe has a Tribe Leader who is responsible for helping coordinate across Squads and encouraging collaboration.

In our example, for the Squad in charge of the feature “Get Data Governed for growth”, our OKRs map tells us that there is a Tribe in charge of “Get the Data Catalog ready”.

Chapter

Even though Squads are autonomous, it’s important that specialists (Data Stewards, Analysts) align on best practices. Chapters are the family that each specialist has, helping to keep standards in place across a discipline.

Guild

Team members who are passionate about a topic can form a Guild, which essentially is a community of interest (for example: data quality). Anyone can join a Guild and they are completely voluntary. Whereas Chapters belong to a Tribe, Guilds can span different Tribes. There is no formal leader of a Guild. Rather, someone raises their hand to be the Guild Coordinator and help bring people together.

Here is an example of a Feature Team organization:

Don’t miss next week’s SE02 E01:

Building your Data Community, where we will help you adapt your organization in order to become more data-driven.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale. Actian data intelligence solutions help streamline complex data environments and accelerate the delivery of AI-ready data. Designed to be flexible, Actian solutions integrate seamlessly and perform reliably across on-premises, cloud, and hybrid environments. Learn more about Actian, the data division of HCLSoftware, at actian.com.
Data Intelligence

Data Governance Framework | S01-E04 – SWOT Analysis

Actian Corporation

May 9, 2021

episode 4- SWOT analysis cover

This is the fourth episode of our series “The Effective Data Governance Framework”. Split into three seasons, this first part will focus on Alignment: understanding the context, finding the right people, and preparing an action plan for your data-driven journey. 

This episode will give you the keys to building a concrete and actionable SWOT analysis.

Season 1: Alignment

Evaluate your Data maturity

Specify your Data strategy

Getting sponsors

Build a SWOT analysis

Season 2: Adapting

Organize your Data Office

Organize your Data Community

Creating Data Awareness

Season 3: Implementing Metadata Management With a Data Catalog

The importance of metadata

6 weeks to start your data governance journey

In our previous episode, we discussed the different means to obtain the right level of sponsorship to ensure endorsement from decision makers.

This week, we will teach you how to build a concrete and actionable SWOT analysis to assess the company Data Governance Strategy in the best possible way.

What is a SWOT Analysis?

Before we give our tips and tricks on building the best SWOT analysis possible, let’s go back and define what a SWOT analysis is. 

A SWOT analysis is a technique used to determine and define your Strengths, Weaknesses, Opportunities, and Threats (SWOT). Here are some examples:

Strengths

This element addresses the things your company or department does especially well. This can be a competitive advantage or a particular attribute on your product or service. An example of a “strength” for a data-driven initiative would be “Great data culture” or “Data shared across the entire company”. 

Weaknesses

Once your strengths are listed, it is important to list your company’s weaknesses. What is holding your business or project back? Taking our example, a weakness in your data or IT department could be “Financial limitations”, “Legacy technology”, or even “Lack of a CDO”. 

Opportunities

Opportunities refer to favorable external factors that could give an organization a competitive advantage. Few competitors in your market, emerging needs for your product.. all of these are opportunities for a company. In our context, an opportunity could be “Migrating to the Cloud” or “Extra budget for data teams”. 

Threats

The final element of a SWOT analysis is Threats – everything that poses a risk to either your company itself or its likelihood of success or growth. For a data team, a threat could be “Stricter regulatory environment for data” for example.

How to Start Building a Smart SWOT Analysis

Building a good SWOT analysis means adopting a democratic approach that will ensure you don’t miss important topics.

There are 3 principles you should follow:

Gather the Right People

Invite different parts of your Data Governance Team stakeholders from Business to IT, CDO and CPO representatives. You’ll find that different groups within your company will have entirely different perspectives that will be critical to making your SWOT analysis successful.

Throw Your Ideas Against the Wall

Doing a SWOT analysis consists, in part, in brainstorming meetings. We suggest giving out sticky-notes and encouraging the team to generate ideas on their own to start things off. This prevents group thinking and ensures that all voices are heard.

This first ceremony should be no more than 15 minutes of individual brainstorming, Put all the sticky-notes up on the wall and group similar ideas together. 

You can allot additional time to enable anyone to add notes at this point if someone else’s idea sparks a new thought.

Rank the Ideas

It is now time to rank the ideas. We suggest giving a certain number of points to each participant. Each participant will rate the ideas by assigning points to the ones they consider most relevant. You will then be able to prioritize them with accuracy.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale. Actian data intelligence solutions help streamline complex data environments and accelerate the delivery of AI-ready data. Designed to be flexible, Actian solutions integrate seamlessly and perform reliably across on-premises, cloud, and hybrid environments. Learn more about Actian, the data division of HCLSoftware, at actian.com.
Data Platform

Actian on Google Cloud Delivers High-Speed and Pay-for-What-You-Need

Actian Corporation

May 6, 2021

Sync Your Data From Edge-to-Cloud with Actian Zen EasySync

The Actian Data Platform (formerly Avalanche), with the fastest columnar analytics engine available in the market today, is now generally available on Google Cloud Platform (GCP) via Google Cloud and the Google Marketplace. For organizations that need the flexibility to scale dynamically yet pay only for the resources they need, that want an ultra-fast, MPP columnar database engine capable of delivering faster, deeper analytical insights while making it easier than ever for end users to self-access the data they need to do their jobs, this is big news. If ever you needed a good reason to embrace the power of the cloud, this is it.

Google Cloud is the smallest of the big three providers but is growing fast. The experience of building on Google Cloud is hard to beat as well, Google has taken a lot of time to build a great developer experience and we expect that given the great experience Actian has had in building on Google, many others will find similar benefits to using Google Cloud to facilitate their movement to the cloud.

Use Cases – Where Actian on GCP Shines

Interest in data analysis only grows as organizations become more sophisticated and seek greater insights—into their customers’ inclinations, the efficiency of their business processes, and their ability to seize short-lived opportunities. Organizations of all sizes, in a wide range of geographies and industries — including financial services, automotive, healthcare, and retail/e-commerce — are taking advantage of the ultra-high performance and scalability that Actian on GCP offers right out of the virtual box. Some examples are below:

  1. A Fortune 10 financial data services company uses Actian on GCP alongside their trading applications to deliver an edge for its customers, enabling them to perform ad hoc analytics on significant amounts of data returning requested results with sub-second response time.
  2. A top Western European company in the automotive industry has used Actian on GCP to accelerate the delivery — and increase the accuracy — of price quotations for prospects and customers. Data involved in risk assessment, for example, accident reports, driving records, and so forth can change rapidly, so the ability to deliver quotations with up-to-the-moment data provides more accurate risk assessments, delivering a competitive advantage in a highly competitive industry.
  3. Service providers in the healthcare claims payment space leverage Actian operational analytics to extract insights from high volumes of claim data and provide decision-makers with the most up-to-date data for analytical review to eliminate errors, abuse and fraud. Elsewhere in the healthcare sector, innovative clinical trial service units rely on Actian on GCP to analyze large volumes of clinical trial data for patterns and trial performance insights.
  4. A leading French retailer relies on the highly performant analytics engine of Actian on GCP to increase marketing effectiveness and efficiency. The company’s ability to draw insights more rapidly from its own data enables it to optimize marketing spend and gain a better ROI.

Ultra-High Performance

GigaOM, a leading independent organization known for benchmarking database performance, recently put Actian to the test against competing cloud products, including AWS Redshift, Snowflake, Azure, and BigQuery.  In terms of price/performance, Actian beat every one of them. Handily.

And you don’t need to take our word for it: Read the full GigaOM Report.

So, What’s New About Actian?

Under the hood, the Actian database and integration engines are tried and tested. They have been evolving for years in response to real-world demands. Actian on GCP is the latest iteration of this proven platform, and it incorporates not only cutting-edge technological advances, including advances in security and user management.

Some of the exciting new features just announced:

  1. Separation of Storage and Compute – Users of Actian on GCP can scale compute and storage resources independently and automatically. Google Cloud Storage provides all the storage you need – and you only pay for the storage you need.
  2. Google Marketplace Integration – Forget about complicated and time-consuming scripted installations. Actian is now containerized and available directly through Google Marketplace. This has some great benefits, including the ability to subscribe and set up Actian on GCP as a managed SaaS offering with just a few clicks. With the ability to procure Actian through Google, retirement of Google commits can be used for Actian.
  3. Google SSO – Google Account users can gain rapid access to Actian on GCP via Google Single Sign On (SSO).
  4. Role-Based User Management – Actian now provides a GUI-based user management system that enables fine-grained, role-based permission management based on user roles. Actian administrators can use SQL to manage even more fine-grained permissions, but the GUI-based management system provides a fast and easy way to set up role-based user profiles.
  5. Kubernetes Containerization Management – All backend Actian services are now containerized and managed by the Google Kubernetes Engine (GKE). GKE automatically handles the orchestration and management of its underlying pods/containers, which dramatically accelerates backend service execution. Operations such as deleting a warehouse from Actian on GCP can take only seconds—whereas the same operation on AWS or Azure would take minutes. Such optimizations ensure that organizations pay only for what they really need (and don’t prolong resource use unnecessarily). Kubernetes containerization also paves the way for an even better experience in Actian where patches/updates/upgrades are concerned. Read more about why Kubernetes is cool here: Take full advantage of the separation of compute and storage resources with Actian on GKE.
  6. Query Result Caching – Now enabled by default for all warehouses, query result caching accelerates access to insight. After a query is executed against persistent data, its result is placed in cache. If the same query is run again, Actian returns the cached result rather than re-running the query. This significantly improves data warehouse performance and frees up resources to support the execution of novel queries.
  7. REST API – A new REST API enables users to load Actian warehouses directly. The API has a direct link to the underlying data warehouse engine, thus enabling ultra-fast loads to a warehouse.
  8. 1AU Instance Availability – For testing purposes or those use cases that require only a single node (rather than a multi-node cluster), a single Actian Unit (1AU) instance of Actian is available on GCP. The 1AU instance includes the full Actian feature set, including the recent enhancements described above.

Current Features of the Actian Data Platform, and a Few to Come Shortly

Amidst the excitement about the new features and benefits Actian is announcing with the GA release of Actian on GCP, it’s worth stepping back to remember all the other exciting features that have recently been announced—including a robust user interface for managing the lifecycle of data warehouses (now available across Google Cloud, Azure, and AWS). Actian also recently saw a refreshed platform UI for its built-in Query Editor—as well as the ability to use multiple tabs, improved methods for viewing information about database tables, and a host of new charting capabilities.

While Actian on GCP includes pre-installed sample data to help you kick-start the evaluation process, you can also load your own data using out-of-the–box templates designed to connect to data sources such as Salesforce or to data sources on your own desktop such as Excel spreadsheets. Getting data into Actian on GCP is easy, as is connecting to Actian on GCP from your favorite BI and visualization tools (such as Looker, Tableau, and Power BI) is even easier as Actian provides connectivity information directly in the platform.

As mentioned earlier, Actian Data Platform now utilizes Kubernetes in its backend and is a complete re-write of the backend infrastructure. The new backend enables both automation of management functions such as patching and updates while also enabling better utilization of resources. For example, deleting a warehouse in AWS and Azure would take a few minutes, while in Actian on Google Cloud, it takes only a few seconds. These small enhancements help our customers better pay for what they actually use.

Stay tuned for more updates, as Actian is heavily investing in Actian and looking to bring even more great features to the platform very soon.  Want to hear more, join our virtual Hybrid Data Conference on May 25th in North American Eastern and May 27th in Central European time zones from 11 AM to 4 PM, respectively.  I’ll go into much of what was briefly touched on above in far more detail along with many of my Actian colleagues.  You can find out more information here.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale. Actian data intelligence solutions help streamline complex data environments and accelerate the delivery of AI-ready data. Designed to be flexible, Actian solutions integrate seamlessly and perform reliably across on-premises, cloud, and hybrid environments. Learn more about Actian, the data division of HCLSoftware, at actian.com.
Data Intelligence

What are the Differences Between a Data Analyst and a Business Analyst?

Actian Corporation

April 29, 2021

data analyst vs business analyst

The roles of a Data Analyst and a Business Analyst are very often unclear, even though their missions are very different. Their functions are more complementary than not, let’s have a look at these two highly sought-after profiles.

Data is now at the heart of all decision-making processes. According to a study conducted by IDC on behalf of Seagate, the volume of data generated by companies worldwide is expected to reach 175 Zetabytes by 2025…

In this context, collecting information is no longer enough. What’s important is the ability to conclude this data to make informed decisions. 

However, the interpretation methods used and the way to exploit data can be very different. The ever-changing nature of data has created new domains of expertise with titles and functions that are often misleading or confusing.

What separates the missions of the Data Analyst to those of the Business Analyst may seem tenuous. And yet, their functions, roles, and responsibilities are very different…and complementary.

Business Analyst & Data Analyst: A Common Ground

If the roles of a Business Analyst and of a Data Analyst are sometimes unclear, it is because their missions are inherently linked to creating value with enterprise information.

What distinguishes them is the nature of this information.

While a Data Analyst works on numerical data, coming from the company’s information systems, the Business Analyst can exploit both numerical and non-numerical data.

A data analyst must ensure the processing of data within the company to extract valuable analytic trends that enable teams to adapt to the organization’s strategy. The business analyst then provides answers to concrete business issues based on a sample of data that may exceed the data portfolio generated by the company.

A Wide Range of Skills

Data Analysts must have advanced skills in mathematics and statistics. A true expert in databases and computer language, this data craftsman often holds a degree in computer engineering or statistical studies.

The Business Analyst, on the other hand, has a less data-oriented profile (in the digital sense of the term). If they use information to fulfill their missions, they will always be in direct contact with management and all of the company’s business departments. Although the Business Analyst may have skills in algorithms, SQL databases or even master XML language, they are not necessarily an essential prerequisite.

A Business Analyst must therefore be able to demonstrate a real know-how to communicate, listen, hear and understand the company’s challenges. For a Data Analyst on the other hand, technical skills are essential. SQL language, Python, Data modeling and Power BI, IT and analytics expertise will allow them to exploit the data in an operational dynamic.

The Differences in Responsibilities and Objectives

The Data Analyst’s day-to-day work consists above all of enhancing the company’s data assets. To this end, he or she will be responsible for data quality, data cleansing and data optimization.

Their objective? To provide internal teams with usable databases in the best conditions and to identify all the improvement levers likely to impact the data project. 

The Business Analyst will benefit from the work of the Data Analyst and will contribute to making the most of it by putting the company’s native data into perspective with peripheral data and information. By reconciling and enhancing different sources of information, the Business Analyst will contribute to the emergence of new market, organizational or structural opportunities to accelerate the company’s development.

In short, the Data Analyst is the day-to-day architect of the company’s data project. The Business Analyst is the one who intervenes, in the long run, on the business strategy. To meet this challenge, he or she bases his or her action on the quality of the data analyst’s work.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale. Actian data intelligence solutions help streamline complex data environments and accelerate the delivery of AI-ready data. Designed to be flexible, Actian solutions integrate seamlessly and perform reliably across on-premises, cloud, and hybrid environments. Learn more about Actian, the data division of HCLSoftware, at actian.com.
Data Intelligence

Data Governance Framework | S01-E03 – Getting Sponsorship

Actian Corporation

April 28, 2021

This is the third episode of our series “The Effective Data Governance Framework”. Split into three seasons, this first part will focus on Alignment: understanding the context, finding the right people, and preparing an action plan for your data-driven journey. This third episode will give you the keys to getting good sponsorship for your data projects.

Season 1: Alignment

Evaluate your Data maturity

Specify your Data strategy

Getting sponsors

Build a SWOT analysis

Season 2: Adapting

Organize your Data Office

Organize your Data Community

Creating Data Awareness

Season 3: Implementing Metadata Management With a Data Catalog

The importance of metadata

6 weeks to start your data governance journey

In the previous episode, we discussed how best to use OKRs to draft your enterprise data strategy, ensure focus, accountability and engagement from the stakeholders with as much transparency as possible and negotiate objectives at all levels.

To a certain extent, the OKRs should help you get good sponsorship.

In this third episode, we will share insights on how best to get sponsorship.

In order to trigger an Effective Data Governance Initiative, you will need to go through the following steps, with caution:

Step 1: Identify Potential Sponsors

The first step consists in identifying all the potential sponsors and setting up one to one (or one to many if you involve many colleagues) meetings to ensure endorsements and move forward on the Data Governance you want to put in place. You have learned a lot from the OKR meetings and now have the substance to ensure their support.

Step 2: Prepare Your Storytelling

The second step is to prepare a story for each sponsor. Again, based on the workshops you were involved in on the company Data Strategy, you should be able to draft a personalized story.

You have 3 forms of storytelling which can be combined if needed:

  • Use a testimony and real story to strengthen yours.
  • Use a metaphor to illustrate the data concepts when they feel too complex.
  • Use a “springboard” story from a specific characteristic to give the big picture.

Step 3: Present Yourself

The third step consists in getting ready to describe who you are, what you do and why you do it through the prism of every sponsor.

Step 4: Asking for Money

The fourth step consists in getting ready to ask for the money. Asking for money involves proposing different scenarios with different outcomes, a detailed analysis on the costs, a quantitative view on the financial benefits and then a ROI analysis.

Step 5: Commit to Deliverables

The fifth step is to commit to deliverables. There won’t be endorsement if you don’t commit to tangible deliverables, results as well as a time frame.

How to Maximize Your Chances for Getting the Sponsors Aligned:

Ask for More Than You Need

Don’t sell yourself short and be prepared for a cut in your funding expectations and prepare accordingly.

Get a Champion

In the list of sponsors, try to build a good relationship with one in particular and ask for help and insights to maximize your chances of winning.

Be Impeccable in All Aspects

When you’re courting a sponsor, always keep to your word, always be on time or early for an appointment. Let him or her know you are a person of integrity. Don’t forget to share the OKRs Map in which the sponsor is involved down to your own OKR.

Be Brief and Sharp

Ask for what you want, but don’t take up a lot of potential sponsors’ time doing it.

Get Commitments

At the end of the sponsorship process, you should be able to get the following outcomes:

  • Get understanding and alignment.
  • Get funding (means and resources).
  • Get help in removing impediments (and build a fast track in the organization hurdles).
  • Get a schedule to organize feedback ceremonies.
actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale. Actian data intelligence solutions help streamline complex data environments and accelerate the delivery of AI-ready data. Designed to be flexible, Actian solutions integrate seamlessly and perform reliably across on-premises, cloud, and hybrid environments. Learn more about Actian, the data division of HCLSoftware, at actian.com.