Touro GST Search
Go to Top of Touro GST website

Let’s Go Jump in a Data Lake Understanding Data Storage for Future Analysts

Prof. Jacqueline Spiegel-Cohen

2026-02-25


Explore databases, data warehouses, and data lakes in simple terms—and discover why storage shapes the questions data analysts can answer.

A four-panel infographic illustrating data storage concepts: a glowing

By Jacqueline Spiegel-Cohen (and Google Gemini)


If you’re thinking about studying data analytics, you’ll hear a lot of new phrases tossed around—databases, warehouses, lakes, pipelines. It can sound intimidating, like there’s some secret room everyone else already knows how to enter. There isn’t. Let’s demystify it.


Data Is Just Stuff—Until You Decide What to Do With It


At its heart, data is simply recorded facts: names, dates, prices, clicks, temperatures, grades, heartbeats. What matters isn’t just having data—it’s how it’s stored, because storage shapes what questions you can ask later. Think of data storage as different kinds of containers for different purposes.



The Spreadsheet: The Kitchen Table


This is where many people start. A spreadsheet is flat, visible, familiar. Rows and columns. Easy to open, easy to break.


Spreadsheets are wonderful for learning patterns and small-scale analysis—but they don’t scale well, don’t enforce rules, and don’t like multiple people touching them at once. Great for home cooking. Not great for feeding a city.



The Database: The Filing Cabinet


A database is organized, structured, and rule-bound. Tables relate to each other. Data types matter. You can’t just put anything anywhere.


This is where operational data lives: customer records, orders, grades, inventory. Databases are optimized for accuracy and speed—What is the current balance? Who is enrolled right now? If spreadsheets are casual, databases are disciplined.



The Data Warehouse: The Library


A data warehouse is designed for analysis, not daily operations. It collects data from many sources, cleans it up, and reshapes it so patterns are easy to see.


Warehouses answer questions like:

  • What changed over time?
  • How did last year compare to this year?
  • What trends are emerging?

If a database is about now, a warehouse is about history.



The Data Lake: The Wild Swimming Hole


And now—yes—the data lake.


A data lake stores everything:

  • structured data (tables),
  • semi-structured data (logs, JSON),
  • unstructured data (text, images, audio).

Raw. Untamed. Often messy.


You don’t need to decide upfront how you’ll use the data. You just keep it. That flexibility is powerful—but dangerous. Without care, a data lake can turn into a data swamp. Hence the phrase: Go jump in a data lake. Refreshing if you know what you’re doing. Risky if you don’t.



Why This Matters for a Future Data Analyst


Learning analytics isn’t just learning tools—it’s learning judgment:

  • What data belongs where?
  • What questions can this structure answer?
  • What assumptions are hidden in the storage itself?

Storage is not neutral. It shapes truth.


When you understand how data is stored, you begin to understand why some questions are easy, some are expensive, and some are impossible. And that’s when you stop feeling like data is happening to you—and start using it intentionally.



So… Why Step Into All This?


Because data lives everywhere now.


Every decision—medical, financial, educational, environmental—leaves a trail. But raw data doesn’t speak on its own. It needs people who are curious, careful, and thoughtful enough to ask the right questions of it.


Data analytics sits at a rare crossroads:

  • logic and creativity,
  • structure and exploration,
  • discipline and discovery.

Some days you are organizing the filing cabinet. Other days, you are standing at the edge of the lake, deciding where to dive.


You don’t need to have all the answers when you start. You learn by seeing how data behaves in different environments—what it reveals when it’s tightly structured, and what it whispers when it’s left raw and wide open.


If you enjoy patterns, puzzles, storytelling, or simply making sense of complexity, this field invites you in. It’s dynamic, evolving, and deeply human at its core—because behind every dataset is a real-world question that matters to someone.


So yes—go jump in a data lake. Just bring curiosity, respect for the water, and a willingness to learn how to swim.


More Posts