Data lake
What is a data lake?

A data lake is a repository where data is ingested in its original form without alteration. Unlike data warehouses or silos, data lakes use flat architecture with object storage to maintain the files’ meta data. It is most useful when it is part of a greater data management platform and integrates well with existing data and tools for more powerful analytics. The goal is to uncover insights and trends while being secure, scalable, and flexible.

Abstract image of data slice.
  • Data lakes explained
  • Why do organizations choose data lakes?
  • Benefits of a data lake
  • Data lake vs. data warehouse
  • What are data lake platforms?
  • How are data lakes used today?
  • HPE and data lakes
Data lakes explained

Data lakes explained

A data lake is used to hold a large amount of data in its native, raw format in a central location—typically the cloud. By leveraging inexpensive object storage, open formats, and cloud scalability, a variety of applications can take advantage of the wealth of data contained in a data lake.

  • All types of qualitive data, including unstructured (often called big data) and semi-structured data can be stored—which is critical for today’s machine learning and advanced analytics use cases.
  • In the networking space, think of infrastructure and endpoint telemetry being used as descriptors or classifiers that feed AI/ML models and algorithms to identify baselines and anomalies.
  • As a customer, your infrastructure and endpoint clients feed the data lake, and your networking vendor maintains it to deliver AI-based tools that help IT operate your network more efficiently.
Data lake explained.
TAP IMAGE TO ZOOM IN

Related products, solutions or services

HPE Data Solutions

Related topics

Data Lakehouse

What is AIOps?

Delta Lake