Data Warehousing News, Trends, Analysis

Hardware and software that support the efficient consolidation of data from multiple sources in a Data Warehouse for Reporting and Analytics include ETL (Extract, Transform, Load), EAI (Enterprise Application Integration), CDC (Change Data Capture), Data Replication, Data Deduplication, Compression, Big Data technologies such as Hadoop and MapReduce, and Data Warehouse Appliances.

Future Facts and Type 2 Dimensions

One creates the potential for some interesting anomalies when building a star schema wherein the fact table contains future-dated metrics and any of the dimensions are Type 2.  A Type 2 dimension tracks changes to the data items contained within it. Effectively, each dimension contains a surrogate key, a natural key with a start and stop date, and additional descriptor columns. If any of the descriptor column values change, the existing dimension row has the stop date populated while a new row is inserted with the same natural key, new start date, and new descriptor values. 

Posted May 13, 2020

6 Steps to Win with Data: First, Get Out of Denial

What more companies need today is a “data lab” to create ideas from data and a “data factory” to turn those ideas into products. Google, Amazon, and other data-driven giants already work like this. So should companies outside of technology. 

Posted May 13, 2020

It’s 2020—Do You Know Where Your Data Security Gaps Are?

Threats to data security are constantly evolving and the array of data privacy laws keeps expanding too, making it critical for companies to understand where their data is stored across the enterprise and safeguard it properly. Download the free Cybersecurity Sourcebook to learn about the pit­falls to avoid and the key approaches and best practices to embrace when tackling data security, governance, and regulatory compliance issues.

Posted May 13, 2020

Announcing Data Summit Connect 2020

With most people working from home offices and travel plans on hold, it’s more important than ever to stay connected. That’s why the annual Data Summit conference is going digital this year with Data Summit Connect, a free 3-day series of webinars presented by DBTA and Big Data Quarterly from Tuesday, June 9, to Thursday, June 11. Registration is now open.

Posted May 13, 2020

After the Pandemic: IT Leaders Discuss What Will Change

Business and IT have, in the past, used times of crisis to adapt and transform themselves for the better. While the COVID-19 pandemic has undeniably had a devastating effect on business, and life itself, it may also be a catalyst for change, compelling organizations to rethink their long-term operations and spending amidst the short-term crisis brought on by the health emergency. Industry leaders recently discussed what “the new normal” may look like.

Posted May 13, 2020

MemSQL to Leverage $50 Million in New Capital to Grow Innovations

MemSQL, the No-Limits Database for operational analytics and cloud-native applications, is receiving $50 million in new capital after signing a debt facility agreement with Hercules Capital, the largest non-bank venture debt provider with more than $2.4 billion in total assets, who served as underwriter for the financing.

Posted May 11, 2020

Cloudera Expands Machine Learning Abilities for MLOps

Cloudera, the enterprise data cloud company, is releasing an expanded set of production machine learning capabilities for MLOps, now available in Cloudera Machine Learning (CML). Organizations can manage and secure the ML lifecycle for production machine learning with CML’s new MLOps features and Cloudera SDX for models.

Posted May 06, 2020

Oracle Gains FedRAMP High Authorization for its Cloud Infrastructure-Government Cloud Offering

Oracle Cloud Infrastructure-Government Cloud has achieved FedRAMP High Authorization. “Government customers rely on Oracle Cloud to run their most critical workloads. With FedRAMP High and Impact Level 5 authorizations, we are able to support the highest levels of security standards for unclassified workloads across local, state, and federal government, as well as the Department of Defense,” said Scott Twaddle, vice president, Regulated Markets, Oracle Cloud Infrastructure.

Posted May 06, 2020

Database Performance Monitoring with SolarWinds

As the popularity of open source databases such as MySQL, PostgreSQL, and MongoDB grows, so does the need for “enterprise class” performance monitoring and tuning solutions. Leading online, SaaS, and digital commerce companies now run these databases as the backbone of their applications.

Posted May 06, 2020

Dremio Data Lake Engine Now Available in AWS Marketplace

Dremio has announced the free availability of the Dremio AWS Edition, a self-service data lake engine highly optimized for Amazon Web Services (AWS) and available in AWS Marketplace. “Dremio brings interactive BI and data science to Amazon S3, and with our new AWS Edition we are dramatically lowering the cost per query and making data lake insights accessible to data consumers in organizations of any size,” said Tomer Shiran, chief product officer, Dremio.

Posted May 05, 2020

VAST Data Enhances the Universal Storage Platform

VAST Data, a storage company, is releasing Version 3 of its Universal Storage architecture, introducing more than 20 new features – including support for Windows and MacOS applications, cloud data replication, and native encryption. These latest Universal Storage updates allow enterprises to now marry all-flash performance with archive economics and scale to enable mission-critical and data-intensive enterprise production environments to consolidate their workflows and bring the power of flash storage and fast access to all of their data.

Posted May 01, 2020

What to Look for When Modernizing the Data Lake

Data lake adoption has more than doubled over the past three years. The technologies and best practices surrounding data lakes continue to evolve – and so do the challenges. Currently in use by 45% of DBTA subscribers to support data science, data discovery and real-time analytics initiatives, data lakes are still underpinned by Hadoop in many cases, although cloud-native approaches are on the rise. From data governance and security, to data integration and architecture, new approaches are required for success.

Posted April 30, 2020

Kong Releases Open Source API Design Editor

Kong Inc., a cloud connectivity company, is releasing a new open source tool called Insomnia Designer, offering a collaborative API design editor. Building on Insomnia Core, which Kong acquired in 2019, the software works natively with Insomnia’s testing capabilities to accelerate the development, performance and stability of REST and GraphQL services, the communications backbone of the modern applications and services people rely on each day.

Posted April 30, 2020

Latest Swarm64 Release Accelerates PostgreSQL Performance

Swarm64, a provider of database acceleration solutions for the PostgreSQL open source database, is releasing Swarm64 DA 4.0, database acceleration software that extends PostgreSQL with the ability to analyze data orders of magnitude faster than usual.

Posted April 23, 2020

The Decade of Data

To get a full appreciation for the incredible pace of change in business technology, look at the past 6 years. In 2014, IDC published a report that said that, by 2020, the digital universe would contain nearly as many digital bits as there are stars in the universe, and the data we create and copy annually would reach 44 zettabytes, or 44 trillion gigabytes. Guess what? It’s 2020. And it turns out IDC was correct in assuming that we were about to endure a data deluge.

Posted April 22, 2020

Circonus Announces Availability of Spring 2020 Release with Kubernetes and Cloud Monitoring

Circonus, provider of a machine data intelligence platform, has announced its Spring 2020 release. The release includes a Kubernetes monitoring solution that provides health-based alerting and horizontal pod auto-scaling, cloud monitoring, GCP Marketplace availability, performance improvements, and a more comprehensive Terraform integration. 

Posted April 21, 2020

VAST Data Receives $100 Million in Latest Funding Round

VAST Data, a storage company, has raised $100 million in Series C funding which will be used to drive global expansion and accelerate the company’s next phase of growth.

Posted April 16, 2020

Pepperdata Releases Streaming Spotlight Platform

Pepperdata, a provider of Analytics Stack Performance (ASP) solutions, is releasing Streaming Spotlight, a new product in Pepperdata’s data analytics performance suite enabling Kafka integration. The suite is purpose-built for IT operations teams, giving them a single, comprehensive view of their analytics stack, both in the cloud and on premises.

Posted April 14, 2020

Maximizing the Value of Digital Data

Even before the IT elements of data optimization begin, aligning organizational culture around a data-driven mindset will be a major challenge. Making the case for data optimization is important. Even before the IT elements of data optimization begin, aligning organizational culture around a data-driven mindset will be a major challenge. Making the case for data optimization is important.

Posted April 08, 2020

What Businesses Need to Know About Data Migration in Mergers and Carveouts

With $3.6 trillion in mergers and acquisitions completed in 2019 alone, M&A activity has been booming. However, a merger or acquisition isn’t just a business decision and a business process. It’s also a massive undertaking on the IT side, as you figure out how to migrate and integrate business applications and business data.

Posted April 08, 2020

How Organizations Can Drive Better ROI Through DevOps Testing

The hype around DevOps and its potential to drive greater ROI across a wide range of enterprise operations increased substantially in the last decade. However, as these expectations carry into 2020, organizations will start to take a more sober approach to DevOps implementations. While DevOps was initially seen as a widespread solution to all sorts of enterprise IT issues, the implementation of DevOps approaches is now shaping up to become more strategic and focused, with much of the emphasis on how to maximize the ultimate return on investment.

Posted April 08, 2020

The Next Great Frontier: Automating Data and Application Deployments

DevOps, DataOps, AI, and containers all lead to one important innovation for enterprises seeking to be more data-driven—and that is greater automation. Data-driven enterprises cannot function if data resources and applications are in any way being manually administered, deployed, remediated, or upgraded.

Posted April 08, 2020

Developers Increasingly See Databases as Applications That Need DevOps

When it comes to DevOps, developers increasingly recognize databases to be code sets that require ongoing integration and deployment. They are “another code deployment which can and should be managed, tested, automated, and improved with the same robust, reliable methodologies applied to application code,” according to the authors of a recent survey of 2,000 developers.

Posted April 08, 2020

Startups to Watch in 2020

Cutting-edge startups are constantly emerging to address new challenges and problems in ways never thought possible. Many of these young, innovative companies have fresh approaches that tap into blockchain, quantum computing, advanced analytics, AI, DevOps methodologies, containerization, and data security advancements. To shine a spotlight on some of the ways innovation in IT is being reflected today, here, DBTA presents 28 companies we think are worth watching in 2020.

Posted April 08, 2020

Neo4J Creates Platform for Graph Data Science

Neo4j, a provider of graph technology, is launching Neo4j for Graph Data Science, a data science environment built to harness the predictive power of relationships for enterprise deployments. Neo4j for Graph Data Science helps data scientists leverage highly predictive, yet largely underutilized relationships and network structures to answer unwieldy problems.

Posted April 08, 2020

Talend Extends Partnership with Databricks

Talend, a provider of in cloud data integration and data integrity, is bolstering its partnership with Databricks. With the Winter ’20 release of Talend Data Fabric, including Stitch Data Loader for data ingest, Talend now supports Delta Lake. The comprehensive support enables data ingestion into lakehouse environments where data warehouse management features are combined with low-cost storage.

Posted April 08, 2020

Talend Collaborates with Developers to Provide ETL Tool for COVID-19 Data

Talend is joining the fight against COVID-19 by collaborating with developers from the Singer open source community and Bytecode to create an ETL tool for COVID-19 datasets. Talend standardizes the data, augments it with metadata, then routes the results to a data warehouse or data lake: Amazon Redshift, Amazon S3, Snowflake, Microsoft Azure Synapse Analytics, Delta Lake for Databricks, or Google BigQuery.

Posted April 07, 2020

New Public-Private Consortium Targets U.S. Supercomputing Resources at Fighting COVID-19

The White House has announced the launch of the COVID-19 High Performance Computing Consortium to provide COVID-19 researchers worldwide with access to the world’s most powerful high performance computing resources that can significantly advance the pace of scientific discovery in the fight to stop the virus. The public-private consortium, spearheaded by the White House, the U.S. Department of Energy, and IBM, includes government, industry, and academic leaders who have volunteered free compute time and resources on their machines.

Posted April 06, 2020

Oracle Offers New Cloud Developer Certification and Free Training Resources

Oracle has announced a new Developer Associate certification for Oracle Cloud Infrastructure. The Developer Associate certification is intended for developers who have 6 months of experience in developing and maintaining applications. With this addition, Oracle now offers five distinct certifications for architects, operators, and developers on Oracle Cloud Infrastructure.

Posted March 26, 2020

New LogDNA Features Emphasize Speed and Scalability

LogDNA, a provider of multi-cloud log management solutions, has introduced performance and usability updates that enable developers to more easily query, filter, and gain insight from their log data. “The complexity of developing, deploying, and scaling applications is exponentially more complicated today than even just a few months ago, and the amount of data even small teams deal with on a daily basis is becoming untenable,” said Peter Cho, vice president of product management at LogDNA.

Posted March 25, 2020

Pure Storage Launches Next Generation Flash Array to Accelerate Innovation

Pure Storage, a data solutions provider delivering a modern data experience, is releasing its third-generation all-NVMe FlashArray//X, providing customers with higher performance. With Pure Storage’s Evergreen Storage model, customers can enjoy access to continuous innovation from Pure Storage that includes these and future updates to its product and solutions suite. 

Posted March 25, 2020

Oracle Announces Fiscal 2020 Third Quarter Results

Oracle last week announced strong results for fiscal 2020 Q3. Total revenues were $9.8 billion, up 2% in USD and 3% in constant currency compared to Q3 last year. Cloud services and license support revenues were $6.9 billion, up 4% in USD and 5% in constant currency. “Subscription revenues, made up of cloud services and license support revenues, grew 5% in constant currency. These consistently growing and recurring subscription revenues now account for 71% of total company revenues,” said Safra Catz, Oracle CEO.

Posted March 18, 2020

Discover How InfluxDB Can Support Streaming Data with Flux

Stream processing is gaining prominence within organizations, it unifies applications and analytics by processing data as it arrives, in real-time, and detects conditions within a short period of time from when data is received. The key strength of stream processing is that it can provide insights faster, often within milliseconds to seconds. With that being said, stream processing naturally fits with time series data, as most continuous data series are time series data.

Posted March 17, 2020

Rockset Releases Platform to Hasten Data Application Development

Rockset, a real-time database in the cloud, is releasing Query Lambdas, enabling developers to build data applications faster than ever before. As a real-time database in the cloud, Rockset eliminates roadblocks and, with Query Lambdas, allows developers to use their own data as an API to quickly build modern data applications.

Posted March 12, 2020

DH2i Platform now Available for Linux in AWS Marketplace

DH2i, a provider of multi-platform Software Defined Perimeter (SDP) and Smart Availability software, announced that its DxEnterprise for SDP-enhanced Microsoft SQL Server Availability Groups (AGs) is now available for Linux on RHEL and Ubuntu in AWS Marketplace.

Posted March 11, 2020

Pages

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

Source