Data Analytics

RTDM in Times of Market Uncertainty Can’t be Business-as-Usual

Actian Corporation

April 22, 2020

Real-Time Decision-Making

Saying that the current business climate and market conditions are tough is about as much an understatement as saying a Banana Slug is not terribly fast. Making the right decisions too slowly or wrong decisions too quickly can have equally adverse outcomes.

In certain industries such as Retail, Travel, and Hospitality or for smaller businesses without liquidity or significant credit lines, the COVID-19 pandemic has produced an epidemic of financial hardship, regardless of company size or industry.  The only logical approach is to make the best decisions you can with the information you have.  In other words, as Donald Rumsfeld would say: “You go to war with the army you have not the one you wish you had.” For today’s organizations, our armaments are IT and, one could argue, the higher the degree of digital transformation you’ve achieved, the more “Shock and Awe” you can project.

Under normal conditions, today’s Enterprise organizations have far more visibility into their business process, their employees’ skillsets, and the knowledge of their customers than ever. Automation, the commoditization of compute resources, the democratization of application access, and data sharing have made all but the smallest mom-and-pop shop far more agile than at any point in the past. But here’s the rub, all of the investment in IT for automation, the higher productivity and knowledge level of today’s workforce through the use of technology, the vast global supply chain, and specialized B2B ecosystems thrive on business certainty – disruption is a dirty word.

OK, maybe you think you like the word because you’re in the Tech Industry.  But when we talk about disruption, we generally mean healthy technical progress, supplanting old, outdated technology pragmatically and programmatically over months or years. Sorry Geeks (takes one to know one), Category Five Hurricanes and Tornados, Trade Wars and Shooting Wars, Political Unrest and – as we’re all experiencing now, Pandemics – are not healthy disruptions.

Consequently, our sizeable and sustained IT investments – our armories – are molded by the day-to-day business process requirements, honed over decades of feedback and adjustment to market conditions and a normal business landscape. In fact, even our feedback and adjustment to our business process, to customer engagement, training our workforce – pretty much everything – is performed within these constraints.  The way we determine if our business outcomes are what we expected and IT was supportive stems from the way we use Business Intelligence, Visualization, and Reporting tools.

Natural and man-made disasters (yes combinations of them) generate market uncertainties. The business discontinuities they produce, in turn, create situations where any and everything in your business may need tweaking: customer engagement, supply chains, workforce management, credit risk assessment, and the list goes on.

Unfortunately, your normal closed and, dare I say, rigid feedback loops that you use for assessment aren’t up to the task because things are moving too fast, in ways that you have not seen before. Think bank loans from small businesses that don’t have loans from your bank or binge shopping for peanut butter in one zip code, but toilet paper in zip code – forget it – everywhere!

Collecting, aggregating, and analyzing the right data with speed and accuracy is difficult under these circumstances because your weapons portfolio is meant for doing conventional business – not crisis business. However, it’s under these circumstances that you need Real-Time Decision-Making (RTDM) strategic capabilities to navigate your business through the crisis.

Now, that I’ve got your attention, over the next few weeks, I hope you will read the rest of this six-part series where I will elaborate on and detail out a roadmap for Real-Time Decision-Making strategic capabilities:

  1. Why do you need Real-Time Decision-Making in times of market uncertainty, and what is it anyway?
  2. Should I leverage or leave behind my conventional IT systems I use for my day-to-day business?
  3. Cloud key to Real-Time Decision-Making – necessary but insufficient for strategic capabilities.
  4. In periods of market uncertainty, organizations that are further along on their digital transformation maturity curves are better prepared to apply Real-Time Decision-Making.
  5. The silver lining of Real-Time Decision-Making in periods of market uncertainty: innovation.

Find out more about Real-Time Decision-Making strategic capabilities delivered on Actian Data Platform real-time connected data warehouse here.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Data Integration

Connected Data Makes Compliance Easier

Actian Corporation

April 22, 2020

connected data making compliance easier

Regulatory compliance can be a headache for any company. If you do business globally, the challenge is even more significant. Governmental regulations vary greatly across countries and even within local regions. If that weren’t difficult enough, add on a continuous stream of regulatory changes, and you have a real problem. Whether you are trying to perform the compliance functions in-house or leverage an outside firm to assist, connected data is the key to keeping your company in good standing with regulators.

Regulations are Ever-Changing

It would be great if there were a universal set of global regulations that companies could follow. Unfortunately, they don’t exist (and probably never will). Each jurisdiction is empowered to pass its own laws and define its own regulations, and companies that do business in their area of control are responsible for ensuring compliance.

Many large companies have teams of compliance staff dedicated to tracking and analyzing regulatory changes to determine their impact on company operations. Smaller and mid-size companies can’t afford to do the analysis internally, so they either seek the assistance of an outside firm to advise them, or they rely on limited (and often incomplete or obsolete) regulatory guidance available on the internet. Regardless of the approach, someone must figure out what regulations apply to you and how regulatory changes impact your company’s operations.

Companies are leveraging the Actian DataConnect integration platform, and the Actian connected data warehouse to help them aggregate data from many external data sources to compile the big picture of regulations and quickly identify changes that matter to their operations. Connected data enables them to identify risks and opportunities rapidly to ensure continuous compliance as well as exploit new opportunities favorable to the company.

Compliance Issues Permeate Your Company

Regulatory compliance isn’t just a finance issue. Nearly all facets of your company are subject to a variety of regulations that must be understood and complied with. Sales and marketing are subject to GDPR and other data privacy laws. Manufacturing operations have RoHS requirements as well as various regulations governing supply chains and import/export of materials. IT is subject to Sarbanes Oxley controls. HR is responsible for demonstrating compliance with workplace safety (OSHA) and labor regulations. Finance deals with tax, trade, and securities regulations. The list could continue indefinitely.

Although each business function has unique regulations governing their activities, all of them face the same challenges of understanding what the current rules are, applying them correctly, and demonstrating compliance. Compliance monitoring, control, reporting, and audit support require reliable data that span your companies operations. Sarbanes Oxley compliance requires data that spans many of your IT systems. Tax compliance requires data about things like assets, sales transactions, where operations are taking place, and the type of work your employees are performing. Manufacturing compliance requires tracking the flow of materials through complex supply and distribution networks. Managing each of these compliance facets requires accurate and connected data.

How a Data Integration Platform can Help Your Compliance Efforts

Ensuring continuous compliance requires understanding what regulations apply to you and analyzing data about your operations to understand how you are conforming with regulatory requirements. Doing this effectively requires a lot of data from a lot of different sources. Managing connections with each of the needed data sources one by one isn’t practical – the effort required to establish connections is burdensome, and there is little way to ensure that you are seeing the full picture (and not missing something).

A data integration platform, like Actian DataConnect, enables you to connect all your data sources (internal and external) in a consistent, centrally managed way. This then allows you to aggregate disparate data together for analysis and reporting. If you are doing the compliance tasks in-house, the connected data that DataConnect can help you assemble will provide an effective channel to monitor regulatory changes, monitor your company’s activities for compliance, and ensure you are using accurate data for reporting and audit support. If you are using an external firm to assist in compliance, DataConnect can serve as a window into your organization, enabling your compliance partner to monitor your activities and provide you with informed compliance guidance.

Actian is a leading provider of data management solutions used by companies across industries and geographies. Learn more about Actian solutions for financial services and compliance.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Data Intelligence

WhereHows: A Data Discovery and Lineage Portal for LinkedIn

Actian Corporation

April 20, 2020

linkedin-wherehows

Metadata is becoming increasingly important for modern data-driven enterprises. In a world where the data landscape is increasing at a rapid pace, and information systems are more and more complex, organizations in all sectors have understood the importance of being able to discover, understand, and trust in their data assets.

Whether your business is in the streaming industry, such as Spotify or Netflix , the ride-sharing industry, such as Uber or Lyft, or even the rental business like Airbnb, data teams need to be equipped with the right tools and solutions that allow them to innovate and produce value with their data.

In this article, we will focus on WhereHows, an open-source project led by the LinkedIn data team that works by creating a central repository and portal for people, processes, and knowledge around data. With more than 50 thousand datasets, 14 thousand comments, and 35 million job executions and related lineage information, it is clear that LinkedIn’s data discovery portal is a success.

LinkedIn Key Statistics

Founded by Reid Hoffman, Allen Blue, Konstantin Guericke, Eric Ly, and Jean-Luc Vaillant in 2003 in California, the firm started out very slowly. In 2007, they finally became profitable, and in 2011 had more than 100 million members worldwide.

As of 2020, LinkedIn has significantly grown:

  • More than 660 million LinkedIn members worldwide, with 206 million active users in Europe.
  • More than 80 million users on LinkedIn Slideshare.
  • More than 9 billion content impressions.
  • 30 million companies registered worldwide.

LinkedIn is definitely a must-have professional social networking application for recruiters, marketers, and even sales professionals. So, how does the Web Giant keep up with all of this data?

How it Started

Like most companies with a mature BI ecosystem, LinkedIn started out with a data warehouse team, responsible for integrating various information sources into consolidated golden datasets. As the number of datasets, producers and consumers grew, the team increasingly felt overwhelmed by the colossal amount of data being generated each day. Some of their questions were:

  • Who is the owner of this data flow?
  • How did this data get here?
  • Where is the data?
  • What data is being used?

In response, LinkedIn decided to build a central metadata repository to capture their metadata across all systems and surface it through a unique platform to simplify data discovery: WhereHows.

What is WhereHows?

WhereHows integrates with all data processing environments and extracts metadata from them.

Then, it surfaces this information via two different interfaces:

  1. A web application that enables navigation, searching, lineage visualization, discussions, and collaboration.
  2. An API endpoint that empowers the automatization of other data processes and applications.

This repository enables LinkedIn to solve problems around data lineage, data ownership, schema discovery, operational metadata mashup, data profiling, and cross-cluster comparison. In addition, they implemented machine-based pattern detection and association between the business glossary and their datasets, and created a community based on participation and collaboration that enables them to maintain metadata documentation by encouraging conversations and pride in ownership.

There are three major components of WhereHows:

  1. A data repository that stores all metadata.
  2. A web server that surfaces data through API and UI.
  3. A backend server that fetches metadata from other information sources.

How Does WhereHows Work?

The power of WhereHows comes from the metadata it collects from Linkedin’s data ecosystem. It collects the following metadata:

  • Operational metadata, such as jobs, flows, etc.
  • Lineage information, which is what connects jobs datasets together.
  • The information catalogued such as the dataset’s location, its schema structure, ownership, create date, and so on.

How They Use Metadata

WhereHows uses a universal model that enables data teams to better leverage the value from the metadata; for example, by conducting a search across the different platforms based on different aspects of datasets.

Also, the metadata in a dataset and the job operational metadata are two endpoints. The lineage information connects them together and enables data teams to trace from a datasets/jobs to its upstream/downstream jobs/datasets. If the entire data ecosystem is collected into WhereHows, they can trace the data flow from start to finish.

How They Collect Metadata

The method used to collect metadata depends on the source. For example, Hadoop datasets have scraper jobs that scan through HDFS folders and files, reads the metadata, then stores it back.

For schedulers such as Azkaban, they connect their backend repository to get the metadata, aggregate it and transform it to the format they need, then load it into WhereHows. For the lineage information, they parse the log of a MapReduce job and a scheduler’s execution log, then combine that information together to get the lineage.

What’s Next for WhereHows?

Today, WhereHows is actively used at LinkedIn as not only a metadata repository, but also to automate other data projects such as automated data purging for compliance. In 2016, they integrated with systems down below:

In the future, LinkedIn’s data teams hope to broaden their metadata coverage by integrating more systems such as Kafka or Samza. They also plan on integrating with data lifecycle management and provisioning systems like Nuage or Goblin to enrich the metadata. WhereHows has not said its final word.

Sources:

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Data Integration

In Healthcare, Connected Data Drives Holistic Patient Care

Actian Corporation

April 16, 2020

Healthcare using connected data

Providing high-quality patient care is a team effort. Data is what enables healthcare professionals to understand the full scope of issues patients are facing, share diagnostic data, and coordinate care across providers to ensure the best quality treatment recommendations and follow-up care. Over the past decade, healthcare companies have drastically expanded their use of technology both in the direct care of patients and in the management of healthcare operations. When the technology works and the data is flowing, great things can happen. When the technology is disjointed, the effects can be deadly.

Realtime Data is Essential for Making Critical Healthcare Decisions

Whether it is a clinician waiting for test results, an administrator coordinating insurance coverage, or a pharmacy distributing medications – in healthcare, real-time data is essential. Centralized patient records systems have made a tremendous impact on providers’ ability to share information. However, there are still many independent data sources, records management systems, and 3rd party interfaces that slow down the overall healthcare process. Delays in sharing data not only slow the pace of care for patients but also lower the efficiency of clinicians (meaning they see fewer patients) and increase the risk of errors in diagnosis and treatment. These are the reasons why healthcare companies are placing an intense focus on connected data, interoperability of systems, and real-time data-sharing upgrades to their IT systems.

Sharing Data With Government Organizations and Health Networks

The government and health networks play an essential role in orchestrating data sharing between hospitals, independent providers, and pharmacies. The goals of data sharing are twofold.

First is ensuring the best patient care possible. If a patient from Iowa seeks medical attention care while on vacation in Florida, the treating physician and pharmacy need to be able to receive electronic medical records (EMR) from the patient’s primary care doctor and any specialists they have received treatment from to ensure the appropriate treatment is provided. Pharmacies need the ability to check drug interactions for new prescriptions to avoid adverse reactions that could be fatal.

The second reason data orchestration and interoperability across healthcare networks is important is healthcare monitoring. Government organizations play a significant role in tracking infectious diseases (like COVID-19), drug reactions, and other situations that could impact either public health or the safety of various treatments during clinical trials. This data orchestration is often coordinated centrally but implemented by individual hospitals and clinics within their patient care management systems.  In addition to centralized monitoring, the data that is collected is made available to clinicians as a source of research – helping them understand what other providers are seeing, how they are treating various conditions, and the efficacy of those treatments.

How a Data Integration Platform Enables Better Healthcare

The one thing healthcare companies care most about is providing the best care possible for their patients. It doesn’t matter if you are looking at a hospital, private clinic, insurance company, device manufacturer, drug manufacturer, or pharmacy. Patient care is job #1. Actian DataConnect is a data integration platform that enables companies to connect anything, anytime, anywhere. Healthcare companies are using DataConnect to aggregate data from their diagnostic equipment, share test results and imaging across departments, access government data for research, and engage with 3rd party insurance companies and pharmacies. Leveraging a data integration platform, these companies can optimize operations, improve the productivity of clinicians, and provide better care to their patients. Actian DataConnect delivers a consistent and secure way to manage and control the flow of data across your organization – ensuring the right people have access to the right data at the right time.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Data Intelligence

Data Management: Don’t Neglect Your Metadata

Actian Corporation

April 15, 2020

data-management

Data management can be defined as the process of ingesting, storing, organizing and maintaining all data created and collected by an organization to help drive operational decision-making and strategic planning.

It won’t be a surprise if we tell you that data topics are constantly evolving and becoming more complex within organizations. As a result, any organization considering these large-scale data and analytics initiatives is increasingly faced with high-volume data of various types, formats, and distributed environments.

In an attempt to maximize its value, metadata is a response that provides knowledge about where the data is located, what attributes it has, or how it is linked (also called a knowledge graph). Yet, most organizations do not yet have a formal approach to metadata management.

Let us convince you of its necessity in this article.

The Challenges of Metadata in a Next-Gen Data Management

In an increasingly dispersed and complex technology environment, Data Managers or Chief Data Officers are tasked with providing and simplifying a consistent data environment that can be activated by their teams.

Among our clients who have taken the gamble of initiating metadata management, we see a common objective: to ensure the visibility of different data sources and initiatives and to involve new players who do not necessarily have technical profiles.

In short, the need to align semantics across multiple data silos is driving an increased demand for metadata governance capabilities.

In this new discipline of data management, a lever to better describe your data, by including information on its location to facilitate the use and/or protection in diverse environments and sources.

Here is an excerpt of the questions your metadata will be able to answer:

  • Who created this data?
  • Who is responsible for this data?
  • In what applications is it used?
  • What is the level of reliability (quality, speed, etc.) of this data?
  • What are the permitted contexts of use (e.g. confidentiality)?
  • Where is the data located?
  • Where does this data come from (a partner, open data, internally, etc.)?

Our Recommendations to Data Management Stakeholders

For those who are today approaching metadata management as part of data management strategies, we advise to:

  • Progressively deploy an enterprise data catalog by adopting metadata management practices. The use of data catalogs will allow, among other things, to inventory all forms of metadata – technical, but also increasingly business, operational and social – in order to improve the visibility of data management activities.
  • Work with suppliers who are able to accept this diversity in their systems and operate in distributed, independent and increasingly cloud-connected data management infrastructures.
  • Identify metadata management use cases that can be easily activated in order to quickly prove its value. The solution providers selected should be those that automate the discovery, profiling and inventorying of metadata or at least the most tedious of tasks.
actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Data Analytics

Connected Data: Reducing Customer Churn in a Telecom Industry

Actian Corporation

April 13, 2020

Connected data

Customer loyalty is the key to profitability in the telecom industry. Because telecom providers manage large fixed infrastructures that must be offset by revenue, customer churn (attrition) is particularly problematic in this industry.

Low switching costs for customers (supported by government regulations) mean that customer loyalty is the only real tool that telecom companies must reduce their churn rates. Connected data, used to improve service quality, dynamically adjust pricing/promotions, and offer personalized content to consumers, enable telecom providers to influence customer loyalty and increase customer retention directly.

What Causes Customer Churn in the Telecom Industry?

There are four key factors that lead customers to change telecommunications providers. Customer loyalty can be developed by companies directly addressing these factors.

  1. Service quality.
  2. Availability of features and content.
  3. Lower-cost substitutes from competitors.
  4. Negative customer service experiences.

The good news for telecom companies is that all these factors are measurable and addressable. If you can’t measure something, managing it is nearly impossible. Since these factors don’t fall in that group and are measurable, the challenge becomes turning massive amounts of raw data into actionable business insights and decisions that impact customer perceptions.

What Types of Data Are Telecom Companies Connecting?

Telecom providers (both Communication Service Providers (CSPs) and content providers) have a unique opportunity to access rich customer data that isn’t available to many other industries. This is due to the nature of their products/services and the visibility they have to the end-to-end supply chain of communication services. They can see content and service usage through web services and centralized systems. By accessing data from cell towers and deployed infrastructure, companies can add a location dimension to the data. Reaching into individual consumer devices, these companies gain visibility to the last mile of the supply chain and can access data about the types of users/viewers of their services and telemetry on end-user service performance.

How Are Telecom Companies Using Connected Data to Reduce Customer Churn?

When you connect the different layers of data available in the field with company data about subscriptions, billing history, network utilization, and the cost of content, telecom companies have all the pieces to develop a genuinely comprehensive 360-degree view of individual customers. The data about a single customer is interesting but may not be very actionable. By analyzing the data of all customers (or using statistical sampling), telecom companies can identify trends, patterns, and conduct correlation analysis to understand what factors drive service usage behavior and influence customer satisfaction. For example, they will be able to see what types of content are most popular with customers of a specific age or cultural demographic. This can help the company acquire and provide content that increases customer engagement. Aggregated trends against connected usage data are also used for infrastructure planning.

While individual usage may be highly variable, aggregated usage metrics can tell a company how much capacity is being used during different times during a typical week.  These insights can then be used to drive decisions that improve service quality and customer satisfaction. Is there enough capacity in the network and content delivery services to support peak demand? If the overall usage is trending upward, infrastructure upgrades may be needed. If overall usage is trending downward, additional marketing efforts and/or service improvements may be necessary. These insights can help a telecom company to address service quality and competitive pricing issues in broad strokes, but content/feature availability and customer service experiences require a more personal approach.

Using Connected Data to Deliver Personalized Services

Telecommunications services are, to a great extent, commodities. There are multiple cell phone carriers, internet service providers, streaming content providers, and voice services carriers available in most markets. Differentiation between these companies in the eyes of customers comes from the availability of unique content and features as well as the quality of customer service experiences. Telecom companies can use the connected data collected from customers and internal business processes to identify what services and content to recommend to end customers to personalize the service experience.

For example, service usage data for a family may indicate that during afternoon hours, teens and children are the primary consumers of services (after school). During this time, streaming content offerings might be tailored to suggest kid-appropriate shows. Usage data in the late morning may indicate an adult working from home and attending meetings using VoIP. During this period, telecom companies might adjust QOS rules on the network to prioritize this type of traffic, so collaboration apps perform better.

Company performance and profitability are aggregate problems. Customer loyalty is an individual problem that requires a personalized approach for each customer. Connected data about customers can enable telecom companies to address both. Actian can help with a full suite of data management solutions that enable telecom companies to connect all the pieces of customer data together, regardless of their source. Processing and analysis can either take place in edge devices deployed in the network or centralized in the cloud using the Actian Data Platform – Actian’s connected data warehouse solution. With the data management tools from Actian, telecom companies can leverage more data, identify more actionable insights, and transform those insights into actions that directly address the causes of customer churn.

Actian’s solutions deliver the promise of real-time decision-making by enabling a customer service agent to know the call they are on is with a customer who is a flight risk due to poor past service and reduced usage. They need to be armed with the latest retention offers to boost usage and improve their relationship with the carrier.

To learn more about solutions for the telecommunications industry, including customer stories and solution overviews, visit https://www.actian.com/solutions/by-industry/communications-media-entertainment/

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Data Intelligence

How do I Start Metadata Management?

Actian Corporation

April 10, 2020

start metadata management

Metadata management is an emerging discipline that is necessary for enterprises seeking to enhance innovation or regulatory compliance initiatives for their data assets.

Many of them are trying to establish their convictions on the subject and brainstorm solutions to meet this new challenge. As a result, metadata is increasingly being managed, alongside data, in a partitioned and siloed way that does not allow the full, enterprise-wide potential of this discipline.

To value this “new” discipline in your organization, you need to demonstrate its ability to deliver value from the outset. We offer strong data catalog support to produce value in a very short timeframe, in most cases, within a matter of a few weeks.

In this article, we deliver an approach facilitated by a solution like the Actian Data Intelligence Platform: connected, agile, and agnostic about the technologies used in enterprises.

Set a Milestone

For each milestone, you will be asked to identify several elements:

  • What are the Problems: Increasing and sharing knowledge must solve a problem. It can be of various types: compliance for a scheduled audit, centralization and uniformity of a particular piece of information to satisfy a group of collaborators in a difficult situation, for example.
  • What data: It is important to focus efforts on data directly related to the identified problem. Trying to deal with too large a data set will lengthen the time frame and ultimately extend the time at which the achievement of the objective, and therefore the production of value, could be measured.
  • Who are my data users: On the first iteration, which may be longer than any other, the mobilized users will be structuring. They must be able to free up enough time to invest themselves in achieving the objective, but they must also have the motivation to set up metadata management in place. These users will be your first ambassadors moving forward.
  • What time do I have: This iteration must be completed within a reasonably short period of time. On an enterprise-wide basis, we recommend a timeframe between 4 weeks to 3 months maximum, depending on the bandwidth of the people in charge of the subject. This duration should also help to qualify whether a particular problem is appropriate or should be subdivided, or simply discarded.

Sometimes, even before the first milestone is identified, a preliminary introspective exercise on the enterprise’s data governance maturity is carried out.

We suggest this through workshops during which the company, with the help of our maturity matrix, will be able to define its positioning. This type of exercise is of particular interest when it is carried out regularly (e.g. every year). It allows a global assessment of the benefits of deploying your governance program.

Launch the First Milestone

Typically, the chronological sequence for the onboarding phase supported by our metadata management tool is the following:

Our desire is to anchor the launch in a value-producing reflex. Each iteration must bring the company tangible benefits that address your issues. This first iteration includes elements that will not appear anymore, or at least much less, in the following iterations, in particular the technical aspects related to the implementation of the solution.

We suggest by default iterations of 6 weeks. This duration, which is fairly arbitrary, corresponds relatively well to the time generally required to produce significant value while not disrupting too much the activity of the people involved. Indeed, it is necessary to keep in mind that it is rare that the mobilized collaborators have full time to deal with the subject.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Data Analytics

Financial Services Companies Need All the Data Analytics They Can Get

Actian Corporation

April 8, 2020

Financial services companies

There are times when financial markets trend up. There are times when markets trend down. And then there are times when the markets go crazy and the only thing predictable is volatility. When markets encounter these volatile periods, financial services companies become highly reliant on data analytics to determine where the real market forces are coming from and determine how best to react. There are five key analytics capabilities that financial services companies need to perform effectively in a volatile market environment.

Connected Data

The more data you have available to analyze, the more accuracy you can gain. There is a whole science around correlation analysis that we won’t go into here, but in general, adding more diverse data sets to your analysis gives you more variables to analyze. This, in turn, increases the likelihood of discovering strong correlations between market forces and market performance. The challenge that financial services companies encounter (and the capability they need to develop) is integrating many data sources together to feed their analytics algorithms.

Robust Analytics

Modern financial analytics isn’t done manually; they leverage advanced technology. Financial services companies must fully harness Artificial Intelligence (AI) and Machine Learning (ML) with real-time connected data warehousing of data from a disparate range of customer, business partner, and even governmental sources to fully optimize growth, profitability, and business risk. Your analytics algorithms are what comb through the data, looking for trends, relationships, and meaningful outliers that can then be transformed into actionable market insights. More robust analytics capabilities enable you to analyze more data and discover more meaningful insights.

Engagement Tools

Most financial services companies don’t operate in isolation. They are part of a greater services value chain, building on the work of upstream suppliers, providing value-add services, and providing capabilities to a group of downstream customers. During volatile market times, it is essential that these companies have the capabilities to deliver critical news, information, and analytics to the global financial community and their customers – enabling transactions and connecting communities of trading, investing, financial and corporate professionals.

Fraud Detection and Prevention

Market turmoil is distracting for financial services companies and their customers. Hackers and thieves know this and won’t hesitate to exploit the opportunity to attack. Distractions increase the risk of fraud, so ensuring robust fraud detection and prevention mechanisms is essential. The key is making your fraud systems adaptive, leveraging core Artificial Intelligence and Machine Learning capabilities, and feeding them the right data for training and query. AI and ML systems have powerful capabilities for identifying data anomalies, unusual behavior, and executing automated responses. With AI guarding your operations, equipped with robust, real-time data, fraudsters don’t stand a chance.

Data Processing at Enterprise Scale

The most important capability that financial services companies need to navigate through the murky waters of a volatile market is high-performance data processing. You can have access to all the data in the world, the best algorithms, tools for communicating with your customers, and state of the art fraud monitoring, but if you don’t have the processing power to support these things, you have a real problem. That is where Actian comes in.

Actian Data Platform is a connected data warehouse that is designed for massively parallel processing of streaming data in real-time. With the Actian Data Platform, you can monitor news, social feeds, market performance, customer transactions, competitor actions, and more – analyzing these data sources in real-time to determine what is noise and what is important.

Because you’re dealing with streaming data about the market forces and the market conditions that are changing rapidly, it is essential that your analytics system can operate at an enterprise scale with near-zero latency. Actian can deliver.

Learn more about Actian solutions for the financial services industry.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Data Management

Making the Transition From Flat Files to SQLite

Actian Corporation

April 7, 2020

puzzle piece missing to depict going from flat files to sqlite

In December, January, and February, I posted a short series of blogs on Flat file systems. The series of blogs focuses on the continued use of flat files and why they are no longer viable for use in the future.

The first installment focused on flat files and why embedded software application developers readily adopted them. Then in the second installment, I discussed why embedded developers are reluctant to use databases. The third installment looked at why developers cling to the flat file systems. The final installment provided a checklist for embedded developers migrating off flat files.

For this next series, I’d like to turn my attention to the next stepping-stone in many embedded developers’ paths away from Flat Files, SQLite. While it’s progress, I’d like to convince you that you should step right over this stone or jump off it if you’re still teetering on it.

Perhaps we should start with the point of why and how it’s a positive step up from flat files. That, of course, would have to be with respect to where and how it’s applied to real-world requirements. According to the SQLite folks themselves, as stated on the SQLite website, Appropriate uses for SQLite: “SQLite does not compete with client/server databases. SQLite competes with fopen().” This statement is tied to their fundamental tenant that they aren’t meant to be compared with SQL database engines because SQL database engines are intended to be a shared repository for Enterprise data. To be a shared repository, according to the SQLite site and therefore the community, an SQL database engine needs: Scalability, Concurrency, Centralization, and Control. SQLite, on the other hand, is focused on local data storage for individual applications and devices where there is a requirement for zero database administration (they then provide the standard laundry list of typical IoT devices).

The SQLite folks also say it would be great in embedded environments if you could have a single compact file format to support cross-platform data transfer and as well as ad-hoc, home-grown data sets of multiple data types. In fact, SQLite has done benchmarking to show they are 35% faster than file systems and reading and writing this type of data through fread() or fwrite()1.

We agree with most of what SQLite folks are saying: yes, SQL database engines should meet the requirements listed above. Yes, there is a need for local data management that is portable across platforms and can support multiple data types; and, finally, there is definitely no way to avoid a zero-administration environment at the edge for mobile and IoT.

So, with that being said, “U” know what’s coming next, that three-letter word that begins with a “B” and ends with a “T”: BUT. But what if you could have your cake and eat it too? But what if you could get all the features and advantages of Enterprise shared datastore characteristics from an SQL database engine, yet have the ability to scale it down and embed it directly into a local Mobile or IoT application, supporting multiple data types, portability across platforms and required zero database administration?

The answer you should give is yes. However, you may push back with the following reasonable positions:

Firstly, SQLite and file systems are free and popular. Open source SQL database engines (MySQL, Postgres, MariaDB, etc.) aren’t able to run scaled down to an IoT or Mobile footprint. Neither can I run them embedded in my applications.

Secondly, why do people use such a stupid analogy like you can have your cake and eat it too? It’s more like I need a car, not a truck, that’s overkill, so why bother?

Well, my response to you would be that’s so 2015 (by the way, that’s the last time SQLite updated their “appropriate use” page). The fact is, in 2020 and definitely, in 2025, you will need the functionality of a car and a truck with any application you build anyway. Yes, you need to store and analyze far more data locally, even with WiFi-6 and 5G bandwidth. Yet, the sheer increase in the volume of data and the need to handle peer-to-peer and edge-to-cloud device management, sharing of contextual data for governance, common operational pictures, and the like will dictate as much as possible and will still need to take place locally to avoid latency.

Furthermore, many peer-to-peer and edge-to-cloud operations – not to mention gateway operations where you’d take in data from multiple downstream sources – require concurrency for, and control of, those downstream data sources. Gateways and edge datastores will also require scalability such that you can use the same architecture and data portability across platforms. Finally, as you move more functionality to that gateway that would have been in the data center or the cloud, what was considered centralization functionality needs to move there as well.

So, think of this as your car needs two-wheel drive in most instances, but it needs the option of all-wheel drive in the rest. Your car also may need towing capacity. But, also think of this in the reverse, your truck isn’t always towing a boat or being used for off-roading, and maybe it’s carrying additional passengers, and you want many of the comforts of a luxury car.

To summarize, with respect to data management and this analogy, the edge in 2020 and into the next five years requires everything that was needed in the data center for an SQL Data Store (your truck requirements). But also, all that was needed for local device data management when it was standalone (your car requirements).

In essence, you need an SUV that scales from a very small CRV up to a monster size one. This is, unfortunately, not what SQLite is capable of doing, and invariably, when you use SQLite, you’re forced to bolt it on to some other database on the other end. We’ll discuss the drawbacks of this forced marriage in the next blog.

Ready to reconsider SQLite, learn more about Actian Zen. Or, you can just kick the tires for free with Zen Core which is royalty-free for development and distribution.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Data Analytics

Healthcare Analytics Improve Operational Efficiency

Actian Corporation

April 6, 2020

Healthcare analytics

Healthcare is a data-driven industry with data analytics needs that make “big data” seem tiny. For healthcare companies to operate efficiently, they need a high-power data engine to crunch the numbers, analyze the data, and get it into the hands of decision-makers quickly. The healthcare industry is the epitome of business agility, and data analytics is what makes that possible. Here are four examples of how healthcare analytics can help healthcare companies improve operational efficiency.

Healthcare Analytics for Diagnosis and Patient Care

If a patient walks into a clinic with a fever, dry cough, and respiratory symptoms, do they have seasonal allergies that require some antihistamine and a box of tissues or do they have a highly contagious disease like COVID-19 requiring quarantine?

To enable the clinic to diagnose and treat the patient effectively, they need the ability to combine the observations being made with this patient against available information from other providers around the world. They also need to follow guidelines from government organizations, research from drug companies, and many other sources to systematically work through a diagnostic process and determine what the patient’s real problem is and how best to care for them. Healthcare analytics, running behind the scenes at the clinic, in healthcare networks, and in government organizations, provide clinicians with the tools they need to do their job effectively.

The diagram below provides a glimpse of how disparate data sources feed the healthcare analytics systems that are used to improve operations and outcomes:
healthcare analytics diagram

Hospital Operations

Hospitals are massive logistics operations with lots of moving parts. From staffing to patient scheduling to stocking of drugs and supplies, ensuring these operations run smoothly requires high-power data analytics. How many nurses are needed for next Tuesday’s day shift? Are there ICU beds available to handle post-operative care for the people undergoing surgery tomorrow? Does the hospital have enough ventilators and beds to treat COVID-19 patients in addition to the typical patient load? Does the hospital blood bank have enough supply on hand to support the demand for the next couple of weeks?

These are the real questions that hospital administration and operations staff need to answer every day. The answers to these questions come from careful analysis of past trends, current stock levels and patient loads, seasonal forecasts of demand, and modeling of potential “unexpected events.” Failure isn’t an option. You can’t just say to a patient, “I’m sorry, we’re temporarily out of stock of ventilators, but don’t worry, we’ll have more in next month.” Healthcare doesn’t work that way. Healthcare companies use data analytics to plan for the unknown so they can be adequately prepared.

Clinical Trials

The development of new drugs, medical devices, and treatment procedures is a collaborative effort between manufacturers, CROs, healthcare providers, and governmental organizations. Clinical trials are a necessary part of the healthcare process to ensure the safety and efficacy of medical care. Clinical trials involve robust data collection and protocol adherence to ensure the confidence of the results and data that are produced. Data analytics plays an essential part in the clinical trial process. It is used to aggregate data being collected in the trial to analyze and interpret findings and share results with the stakeholders participating in the clinical trials. Data analytics are also employed to investigate adverse reactions, isolate testing anomalies, and quantify the risks of new procedures, drugs, and devices.

Insurance Claim Processing

Processing insurance claims accurately and claims accuracy efficiently leads to higher reimbursement rates and faster payments to hospitals and providers. For small clinics and independent providers, efficient claims processing with insurance companies is essential for maintaining cash flow to keep your business operating. For hospitals and large organizations, the complexity of reconciling the activities taking place over many departments with claims approval and processing spanning multiple insurance companies can be a logistical nightmare.

What do all healthcare companies have in common? Frustration. Data integration and analytics can help healthcare providers to improve their claims processing accuracy and reduce the processing time. Automating the flow of data and performing real-time analysis on patient records to identify potential errors and issues in the insurance reporting process so they can be addressed at the time of treatment instead of needing to be reconciled and fixed days (or months) later.

If you want to do healthcare analytics right, you will need a high-power analytics engine designed for the scale, speed, and performance that the healthcare industry demands. Actian Data Platform provides the analytics capabilities healthcare companies need at an enterprise scale at an affordable price.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Data Architecture

Data Warehouse Appliances are Becoming History

Actian Corporation

April 6, 2020

Data warehouse appliances

The era of data warehouse appliances is coming to an end, rapidly being replaced with a new generation of cloud services. The first data warehouse appliance was introduced onto the market in 2003. For nearly 15 years, this market grew with several large technology players, including IBM, Oracle, and Teradata developing or acquiring database appliance offerings.

These self-contained hardware devices enabled companies to scale their data warehouse infrastructure to support the expanding needs of business analytics. We are now entering a new era when the hardware solution to the data warehouse scaling problem is being replaced with cloud services – enabling even greater elasticity, scale, speed, and performance while lowering the capital costs of hardware infrastructure.

Data Warehouse Appliances Served Their Purpose

A data warehouse appliance is a stand-alone set of hardware (servers, memory, storage, and I/O channels), loaded with an operating system, database management system, and administrative tools) designed to support a multi-node database deployment. They were sold as a self-contained unit that could be installed in a data center, came pre-configured for redundancy and high availability, and often included service and support from the vendor that manufactured them.

Data warehouse appliances did an excellent job of solving the data warehouse scaling problem in the early 2000s for companies that wanted to run enterprise-scale analytics on their OLTP datasets but didn’t want to design, build and operate data warehouse infrastructure themselves.

Data warehouse appliances gave companies a “ready to install” solution instead of a “box of parts” with some assembly required.

The big challenge with these systems is that they were expensive to acquire and costly to operate. Scaling via hardware isn’t something that can be done quickly. There are lead times to procure, install and configure new equipment, and capacity increases couldn’t be made in small increments – meaning companies often had to purchase (in advance) capacity that they wouldn’t need for a few quarters. At the time, however, this was the best option on the market, and companies were happy to have data warehouse appliances available as their time-to-value was faster than a do-it-yourself approach.

Hardware Refresh Cycles and the Shift to the Cloud

Technology changes quickly, and (fortunately) new capabilities are being introduced every day that offer companies computing options with increased performance, capacity, and scalability along with decreasing costs.

These new developments cause existing systems to lose relative value, driving the need (and desire) for hardware refresh cycles. Data warehouse appliances (because they are hardware products) have a finite useful life before it is advantageous for companies to replace them with faster, cheaper alternatives. In most cases, the expected life of a hardware data warehouse appliance is about ten years. That is important because the peak of the database appliance market was the period from 2008-2013, and those systems are now due for refresh and replacement.

Over the past decade (since many database appliances were installed), companies have adopted and embraced cloud services as a viable enterprise computing option.  Cloud data warehouses (like Actian Data Platform) go beyond the “ready to install” capabilities of database appliances and provide “ready to use” services that lower operating and maintenance costs even further. Instead of simply upgrading to a newer version of their hardware data warehouse appliances, many companies are evaluating cloud data warehouses as a preferable alternative. In addition to ease of administration, cloud data warehouse solutions also provide three key benefits over hardware data warehouse appliances.

  1. No capital outlay costs – you pay for what you need when you need it.
  2. Dynamic scalability – with hardware, you have the capacity you have. With the cloud, you can scale up or down, depending on your business needs.
  3. Continuous technology refresh – you don’t have to wait ten years to upgrade when new capabilities are available, you can quickly adopt them.

Data warehouse appliances are becoming history. It is time to move to a better, cheaper, faster alternative. If you are a company that is using data warehouse appliances to support your data warehouse today, you should consider a shift to the cloud or hybrid-cloud at your next refresh cycle. Actian provides the next generation of hybrid cloud data warehouse capabilities designed for the needs of modern business. Actian provides cloud-scale performance, availability, and resiliency with a cost model that better aligns with the needs of today’s companies.

To learn more, visit www.actian.com/data-platform.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Data Intelligence

Data Ops Rules to Avoid Data Oops

Actian Corporation

April 6, 2020

data oops to data ops

Data Ops is a new way to address the deployment of data and analytics solutions.

The success of this methodology is based on techniques that promote faster, more flexible, and more reliable data delivery. To deliver on this promise, let’s take a moment and analyze this sentence: “The focus is not just on building the systems right, but also building the right systems”.

Many different definitions, interpretations, and publications address DataOps as a concept, but it is much more than just that. It is a way of understanding, discovering, producing analysis, and creating actionable intelligence with data. In a changing world that revolves around data, latencies in data products or their analysis are no longer acceptable.

The entire organization must be put to work to support the deployment and improvement of data and analysis projects!

Data Oops Definition

The concept of DataOps emerged in response to the challenges of failing data systems and failing data project implementations, but also the fragility, friction, or even fear when it comes to the use of data. If you are experiencing this situation, then don’t look too far…you are in the middle of a Data Oops!

In this context of Data Oops, you will agree that your data teams are struggling to achieve the speed and reliability of directed projects.

The main reasons are that companies have too many roles, are too complex, and have constantly changing requirements or objectives, making tasks difficult to frame and deliver.

This complexity is exacerbated by a lack of confidence in data, even to the point of “fearing” it. This occurs when we observe limited or inconsistent coordination between the different roles involved in the construction, deployment, and maintenance of data flows. We are convinced that an organization that does not know its data is doomed to fail.

How to Succeed in Your DataOps

Simply put, DataOps is a collaborative data management practice that aims to improve communication, integration and automation of data flows between data managers and data consumers within an organization. It is based on the alignment of objectives confronted by results. DataOps accepts failure and is built through continuous experimentation.

Here’s a list of principles for successful DataOps:

  1. Learn from DevOps, through their techniques for developing and deploying agile applications in your data and analysis work.
  2. Identify quantifiable, measurable and achievable business objectives. You will then be able to communicate more regularly, progress towards a common goal and adjust more easily.
  3. Start by identifying and mapping your data (type, format, who, when, where, why, etc.) using data catalog solutions.
  4. Encourage collaboration between different data stakeholders by providing communication channels and solutions for sharing metadata.
  5. Take care of your data, as it may produce value at any given time. Clean it, catalog it, and make it a part of your enterprise’s key assets, whether it is valuable now or not.
  6. A model may work well once, but not on the next batch of data. Over-specifying and over-engineering a model will likely not be applicable to previously unseen data or for new circumstances in which the model will be deployed.
  7. Maximize your chances of success of introducing a DataOps approach by selecting data and analysis projects that are struggling due to a lack of collaboration or are struggling to keep pace. They will allow you to better demonstrate its value.
  8. Keep it agile, short designed, develop, test, release, and repeat! Keep it lean and build on incremental changes. Continuous improvement is found when a culture of experimentation is encouraged and when people learn from their failures. Remember, data science is still science!

What are the Benefits of DataOps?

DataOps helps your business move at the speed of data – keeping pace to deliver the right data. It focuses data activities to be aligned with business objectives, and not on the analytic inputs (big data hype). DataOps also focuses on delivering value from all your data activities, from even the smallest of these can inspire cultural changes needed for other implementations to come.

Adopting DataOps in a culture of experimentation is good data practice and empowers the innovators across the organization to start small and scale fast. It is the path to good business practices.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.