MSDN Community | Big Data Topics

Apache Hadoop

Explore the core components of Hadoop, including HDFS, MapReduce, and YARN, for distributed storage and processing of large datasets.

Category: Frameworks | Last Updated: 2023-10-27

Learn about Spark's in-memory processing capabilities, its APIs (Scala, Python, Java, R), and how it accelerates big data analytics.

Category: Processing Engines | Last Updated: 2023-10-26

Understand Kafka's role as a distributed event streaming platform, essential for real-time data pipelines and stream processing.

Category: Streaming | Last Updated: 2023-10-25

Discover various types of NoSQL databases (e.g., MongoDB, Cassandra, Redis) and their applications in handling unstructured and semi-structured data.

Category: Databases | Last Updated: 2023-10-24

Explore the concepts of data lakes and data warehouses, their differences, and best practices for data storage and management.

Category: Data Management | Last Updated: 2023-10-23

Learn how MLOps practices apply to big data scenarios, covering model deployment, monitoring, and lifecycle management.

Category: Operations | Last Updated: 2023-10-22