Data Management

Worried About SQLite Security Vulnerabilities?

Actian Corporation

December 19, 2018

SQLite - Transactions

Face the Inevitable: Local Persistent Data at the Edge Will Happen

It’s indisputable that edge intelligence will grow, whether that’s mobile applications running on smartphones, IoT applications running in smart cars (or the underlying sensors), the entertainment center, navigation systems, etc. There will be countless mobile and IoT – taken as a whole, edge scenarios – where a native application will be a better approach than a web-based application, or where it would be inefficient/potentially less secure to send raw data back from IoT collection points – rather than process the data and locally store or erase the input data.

The complexity of process and workflow at the edge, the ability to run analytics at the point of action, and working in disconnected modes or with spotty connections are all examples of why you will need local data storage and therefore a local database. It’s a foregone conclusion that data associated with these applications will mushroom.

Unfortunately, what’s equally unavoidable is that security vulnerabilities and opportunistic attacks that target these weaknesses will increase for the foreseeable future.  There have been several studies and surveys undertaken over the last few years that clearly show a far larger number of security vulnerabilities in IoT and mobile device-based software than on more mature desktop or laptop platforms, let alone software running on servers in the data center. Let’s not forget that 10 years ago, each security breach in the cloud generated a sense of panic and perhaps slowed the adoption of cloud services. This could very well be where we are now with localized and embedded data management for edge devices.

Case in point, over the weekend a very serious security vulnerability was discovered in SQLite and the web-bundled version of SQLite in Chromium (the Open Source roots for Google Chrome). While this is not the first or the largest potential breach point found in Open Source data management – after all, the Heartbleed virus in 2014 that took advantage of OpenSSL probably holds both of these records – because this vulnerability is associated with SQLite, a database that is near ubiquitous in mobile native and web-based apps, and its APIs, we should brace for the knee-jerk reaction: perhaps data shouldn’t be stored locally on edge devices and everything should be done in the cloud, where it’s assumed to be more secure (my, how times have changed).

Retrenchment Would Be An Overreaction

First off, SQLite is far better than a combination of temporary memory allocation and flat file systems, an approach I’d never recommend to anyone I call a friend. Why? Unlike memory allocation and the use of flat files, which provide little standardization, built-in indexing, or other real data manipulation, SQLite provides baseline database support for edge intelligence.

SQLite is able to run on a device to support fully optimized use of local compute resources, providing an application with the ability to handle local data management – yet offering the same set of APIs calls for a web-based version of that same app, or even work on both the native and web components of a more complex app. It handles most SQL API calls, so it’s also standard.

Settling Would Be An Equally Poor Choice

However, SQLite has many drawbacks compared to a commercial, enterprise-grade embedded database.  Most notably, it doesn’t have built-in encryption for data at rest or in transit, let alone at 128-bit or above. It also can only embed in a single application and single instance, therefore can’t be scaled up to support multiple users that need to send or receive data from that SQLite image.

For example, if you were to put SQLite on a gateway and then have multiple downstream IoT devices attempt to write data to that SQLite instance, there is no way to manage more than one client (downstream IoT device) writing to the SQLite database at a time – a requirement in an IoT environment often with tens, hundreds, or even thousands of devices downstream. However, client-server databases are capable of handling hundreds or thousands of active downstream clients; therefore, flat file and SQLite users must always pair their applications that send or receive data with MS SQL, mySQL, Oracle, or some other client-server database. This pairing guarantees that data reformatting or ETL (Extract, Transform, Load) is a necessary evil.

There are three major drawbacks of ETL that we find most data architects and developers struggle with: integration cost, performance and data security. I’ll save the cost and performance penalties for another blog, but data security is worth discussing here. In the absence of a single architecture across client and server database management, even if you had built-in encryption, you would have no choice but to decrypt and re-encrypt so that you could perform ETL functions – even if you had no other data manipulation to perform. The requirement to decrypt means your data payloads are – even if temporarily – exposed to hackers.

A Superior Way to Securely Manage Data at the Edge

The Actian Zen database family is based on a single, scalable secure architecture that allows Zen to run on VMs in the cloud, virtually any operating environment, from Windows, Linux, and Mac OS as a full-fledged client-server database to Windows IoT Core, Raspbian Linux distributions, Android, and iOS as a pared down 2MB client-only edge data management platform. Since Actian Zen runs on virtually anything with completely transferrable APIs (you can use SQL directly or NoSQL/SQL APIs programmatically from most popular programming languages), engine, and underlying file storage, it requires Zero-ETL. It also has 192-bit encryption at rest and in transit thereby removing both integration cost, data security vulnerabilities, and boost performance.

Summary

When it comes to SQLite and the recent security vulnerabilities uncovered, the response must be to plug the security vulnerabilities and reduce the risk by fixing SQLite or going to a superior enterprise class solution like Actian Zen. The answer is not to avoid or severely constrain placement of data on local devices. These constraints will throttle innovation and improved outcomes that will undoubtedly come from intelligence embedded at the point of action. Cloud security has seen marked improvements because vendors, industry customers and standards bodies, as well as government (NIST specifications, FEDRamp, etc.) have taken on the challenge, not run back to legacy environments. There is always going to be risk, but the point is to manage that risk by moving from static, reactionary and periodic checks on security to a risk-based, continuous diagnostics and monitoring approach. Expect nothing less over time for Mobile and IoT data security as vendors – like Actian – work together to help customers stay calm and keep their data at the Edge secure.

Ready to reconsider SQLite, learn more about Actian Zen.  Or, you can just kick the tires for free with Zen Core which is royalty-free for development and distribution.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Insights

When Fresh Data Matters

Actian Corporation

December 9, 2018

Data collection with AI

How quickly does your business environment change? Are your leaders using outdated information, and do they know it? Do you find that your data is slowing your decision-making processes and preventing you from being truly agile? Imagine what you could do if you were to harness the power of real-time data.

Modern businesses operate in a constantly changing, intensely complex, and data-rich environment. The term “dynamic” doesn’t come close to expressing how fast things are changing. Business environments are fluid – as soon as one thing moves, everything else shifts to adapt to the change. Just because you haven’t recognized the change yet, doesn’t mean it isn’t happening. This is important for leaders to understand because it means the information they use for decision-making, unless it is real-time data, doesn’t likely reflect the current reality. If they don’t have accurate information about how things are presently, then how can they be expected to make informed decisions about the future? Guessing is a scary strategy to run a business.

New Technology for Managing Operational Data

For many years, the method companies used to move data from one place to another was through batch processing and data warehousing. Data warehousing emerged as a response to three technical constraints:

  1. Running analytics on operational systems slowed critical transactional performance.
  2. Data needed to be consolidated from different operational sources to become a single source of truth.
  3. Analytic workloads had different performance-tuning requirements than operational systems.

The good news is that many of these technical constraints have now been relieved through advances in IT infrastructure, increased computing capacity, and modern analytics tools. Unfortunately, even modern data warehousing tools have their shortcomings.

  • Batch data loads lead to delays in the current data.
  • IT change-management policies meant to ensure data quality and security increase the development time for new insights.
  • Tuning optimized for batch reporting doesn’t address ad-hoc query performance for discovery.

Tools like Vector from Actian can now enable you to access, integrate, and analyze your operational data in near real-time – creating an operational data warehouse with the scale and economy of a data lake and the consistency and performance of a data warehouse. Unique features, such as vector processing on commodity servers, multi-cloud deployment and zero-performance overhead updates, make Vector the most capable foundation for an operational data warehouse. Instead of waiting for overnight batch processing, your business processes and decision makers can access fresh data to help them understand what is occurring in your company now.

New Capabilities for Driving Increased Impact

It may not be surprising that technology has advanced and enabled some new capabilities, which weren’t available a few years ago – this is the case in all industries and all facets of business. What is exciting is the impact these capabilities can have on your company:

  • Accelerate business-process execution by avoiding data-replication delays.
  • Monitor real-time service availability and performance to prevent business disruption.
  • Fine-tune operations with real-time optimization to see immediate productivity and quality impacts.
  • Increase security and exposure to risk through real-time threat analysis.
  • Become more responsive and proactive with improved data for decision-making.

The speed of change in your business environment is accelerating. To succeed in a fluid, complex and data-rich environment, your company needs the tools to manage your operational data more effectively, so you can transform it into meaningful information and actionable insights. By removing the data-processing delay, leaders can direct quick, informed and decisive actions that enable you to minimize risks and make the most of opportunities. Fresh data is the key to becoming a truly agile organization – you can’t reach that goal if you must wait 12–24 hours for your data to refresh.

Actian is the industry leader in hybrid data management, data integration and analytics. These solutions enable you to connect seamlessly and manage your operational and analytics data for superior performance, insights.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Data Integration

Data Integration: The Connective Tissue of Business

Actian Corporation

December 7, 2018

data integration with Actian

People, products, processes and systems may come and go, but, regardless, of the structural components of your business, data is the one thing that brings all the pieces together, so you can perform as a cohesive unit.

There is no question modern businesses are evolving quickly. Market opportunities are short-lived, technology advancements are happening at a startling pace and customer preferences are constantly shifting –requiring companies to become more agile in how they identify and respond to both opportunities and threats.

Business strategies, supply-chain relationships, organizational structures and business processes must all be nimble to adapt to whatever changes the company encounters.

Modern Businesses Need Agility to Survive

With all this change occurring, it isn’t surprising most companies and employees experience what seems like endless churn. People and processes that were critical yesterday are not needed tomorrow. IT systems enter and exit the company’s technology environment like a revolving door. Supplier and partner relationships continuously change, with new players appearing and others disappearing almost daily.

With all this churn, it raises the question: “How can I match the pace of change?” Companies must constantly change and adapt to survive in the age of digital disruption. They must integrate diverse sources of data, ever-growing volume, and velocity and veracity of data types. New data sources are emerging from on-premises applications, cloud and IoT, and companies need to access this data.

Agility in connecting to diverse data sources and integrating them so you can transform, manage and syndicate data to everyone who needs it is essential.

If Data is the New Oil, Then Integration is the Pipeline That Will Deliver it

It was once thought data existed to support people and processes and as an artifact of IT systems. This understanding was backward – data is the “thing” that makes companies run. People, processes and systems play the supporting role –creating, changing, merging, analyzing, moving and transforming data.

Data is the object or asset, the rest are the tools to help data flow throughout the company, so decisions can be made, products can be produced and company goals can be achieved. Think of it like cars on a highway supporting the goal of helping people travel from their various starting points to their desired destinations. The starting and ending points may change, a bridge or a stoplight may be replaced with something else, but what doesn’t change is a bunch of people on the move.

Data is what fuels your company growth and integration is what connects all the parts of your company.

How Data Can Help You Manage Change

The operational data your company produces (every second of every day) contains a wealth of information about what changes are occurring both within your business and in the environment where you operate. This information can give you insights into how you may need to adapt to take advantage of opportunities and protect your company from threats.

People, process and systems are constantly changing anyway (or so it seems) – data can help you direct this change in structured and impactful ways. Some of this data is sourced from inside your company through your day-to-day operations while other data is gathered from your environment.

The Need for a “Universal Connectivity Platform” to Connect Anything, Anywhere and Anytime

With the great diversity of IT systems your company operates directly, accesses through cloud providers, or connect to as part of the broader business ecosystem (partners, suppliers, customers, governments, banks, etc.), it can be difficult to manage all of the pieces that are coming and going to ensure your employees and leaders have access to all the data they need to do their jobs. That’s why Actian developed the DataConnect integration solution.

DataConnect is that Universal Connect™ platform that empowers anyone to integrate anything, anywhere and anytime. On-premise software, cloud services, partner systems, IoT devices and other data sources can be connected to reporting, analytics and decision-support systems, enabling seamless sharing of data across your company.

As your business evolves to match the dynamic needs of the marketplace, old systems can be removed, new systems can be added and business processes and supplier relationships can change, but your data will remain, to help you understand where you’ve been, where you are now and where you must move during the future.

Actian is the industry leader in hybrid data management, integration and analytics.

These solutions enable you to manage seamlessly and connect your operational and analytics data for superior performance, insights and business outcomes. Actian DataConnect is the universal connection platform that can help you manage your data – the connective tissue of your business, so you are better prepared to respond to the changes ahead. To learn more, visit www.actian.com.

 

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Data Integration

What’s New in Actian DataFlow Version 6.6.1?

Actian Corporation

November 26, 2018

Actian logo

Actian DataFlow is a parallel workflow platform for end-to-end data access, transformation, preparation, and predictive analysis that eliminates performance bottlenecks in your data-intensive applications. Complimentary to the Actian Vector analytic database, DataFlow leverages concurrency, parallelism, and pipelining to accelerate data movement between locations in your data architecture, creating faster results. DataFlow eliminates memory constraints, as well as the need for data movement into specific data stores before analytics are run. DataFlow understands the available resources before breaking up the execution into smaller chunks that can be run in parallel to take maximum advantage of horizontal and vertical scaling.

DataFlow combines the KNIME (open source data mining platform) drag-and-drop visual workflow environment with the underlying Actian DataFlow platform to provide greater control over the entire process of reading the data, performing the transformation and analytic functions, and writing the results.

What’s new in version 6.6.1?

This update to DataFlow certifies new releases of the most popular Hadoop distributions from Apache, Cloudera, Hortonworks, and MapR, as well as improves parallel load features with the most recent releases of Vector for Linux, Windows, and Hadoop. There is also support for five new data types with DataFlow to handle a larger variety of data formats.  Here is the specific list:

  • Added support for Apache Hadoop 3.0.1+.
  • Added support for MapR 6.0.1.
  • Verified Support for MapR 5.2.2.
  • Updated support for Hortonworks HDP 2.6.
  • Updated support for Cloudera CDH 5.15.
  • Updated direct vector loader to work with latest Vector & VectorH (5.0 & 5.1).
  • Added support for 5 new base types: Money, ip4, ip6, uuid, and Intervals (Periods and Durations).
  • Added support for latest AWS authentication library (1.11 previously 1.3).
  • Added support for logical paths in Hadoop.

DataFlow Results:

For data preparation, data onboarding, and ETL use cases, DataFlow can accelerate the process by factors of 20X or more. For one customer who previously required 17 hours to prepare and cleanse data to onboard new customers, DataFlow reduced the time to 43 minutes, almost 24 times faster. Another customer used DataFlow to parallelize and pipeline their data transfer process to reduce their claims processing times from more than 30 hours to just 20 minutes, which is 90 times faster. In a third example DataFlow demonstrated that it could load 500,000 records into an Oracle database in seven seconds, versus a previous runtime of more than 3 minutes. Contact results@actian.com if you are interested in seeing a demonstration.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.
Data Architecture

What is an Operational Data Warehouse? Why Does it Matter?

Actian Corporation

October 23, 2018

the next big thing

Data warehouses have been around for decades, and have established themselves as reliable reporting systems with consistent value. They have also evolved into data marts, specialized appliances, and EDW variants to meet emerging needs, but all of these solutions have their drawbacks when it comes to meeting today’s business demands. Some of the common pitfalls to avoid include stale data, slow query performance, long development cycles, and high costs associated with these variants. There is a better approach. An operational data warehouse (ODW) addresses the need for operational data analytics with the characteristics listed below, without any of the pitfalls:

1. Current:
As the demands for organizations to operate in real-time or in the moment increase, data warehouses need to deliver ever more current data. SQL Hadoop databases commonly fail to handle continuous streams of updates as the file system is optimized for infrequent batch updates, with a moving window of historical data. Lack of current data can mean businesses fail to respond to threats and opportunities fast enough to stay competitive.

2. Fast:
Built on an underlying architecture optimized for analytic query performance, requiring little or no tuning in anticipation of certain workloads (like indexing or aggregations), maximizing the variety of workloads it can support.

3. Scalable:
Scales to large data capacities with an economical and flexible storage layer, connecting to a variety of existing legacy and new sources of data.

4. Secure:
Offers multiple data protection mechanisms to meet enterprise security requirements and comply with tough regulatory environments.

5. Flexible:
Offers flexible deployment options, on-premises and multi-cloud options.

6. Robust:
Delivers enterprise-level resiliency and manageability.

Such an ODW solution provides a database system that can deliver near-real-time insights for ad hoc self-service data discovery and analytics using the most current operational data storage.

actian avatar logo

About Actian Corporation

Actian empowers enterprises to confidently manage and govern data at scale, streamlining complex data environments and accelerating the delivery of AI-ready data. The Actian data intelligence approach combines data discovery, metadata management, and federated governance to enable smarter data usage and enhance compliance. With intuitive self-service capabilities, business and technical users can find, understand, and trust data assets across cloud, hybrid, and on-premises environments. Actian delivers flexible data management solutions to 42 million users at Fortune 100 companies and other enterprises worldwide, while maintaining a 95% customer satisfaction score.