Cloud Academy – Big Data Analytics on Azure

Cloud Academy – Big Data Analytics on Azure
English | Size: 1.03 GB
Category: Tutorial


Microsoft Azure provides robust services for analyzing big data. One of the most effective ways is to store your data in Azure Data Lake Storage Gen2 and then process it using Spark on Azure Databricks.

Pluralsight – Big Data LDN A GDPR Retrospective

Pluralsight – Big Data LDN A GDPR Retrospective-NOLEDGE
English | Size: 120.09 MB
Category: Tutorial


Big Data LDN 2019 | A GDPR Retrospective: Implementation by a Large-scale Data Organization in Reality | Morri Feldman The date May 25, 2018 was a fateful day for many companies that process and store client data, particularly across the EU. On this day, GDPR went into effect and no one really knew quite what its effects would be. This talk will take you through our company’s journey to compliance – the indexers we used to append & delete client data, and a retrospective of how this affected our data processing operations. This will walk you through the design through implementation, as well as expectation vs. real demand. Eventually, what we imagined would be requested by hundreds of clients at best ended up being requested by tens of thousands and growing. Learning how to manage this new compliance demand alongside our day to day data engineering tasks and processes was no easy feat

Linux Academy – Big Data Fundamentals

Linux Academy – Big Data Fundamentals-BiFiSO
English | Size: 607.48 MB
Category: Tutorial

If you’re completely new to big data and aren’t quite sure what it is, why it’s neccessary, and how it works, then this is the course for you! We are going to clarify what big data is (and isn’t), while also defining some other related terms around data characterization and analysis methods. Then, we will talk about some architectural problems with big data and how we solve them with cluster computing, distributed storage, and cluster managment. Lastly, we will cover some of the popular technologies and illustrate how big data is used in the real world to hopefully shine a light on how big data is already impacting your daily life – whether you realize it or not. Let’s get started!

Cloud Academy – AWS Big Data Specialty Data Collection

Cloud Academy – AWS Big Data Specialty Data Collection-STM
English | Size: 560.28 MB
Category: Tutorial

In course one of the AWS Big Data Specialty Data Collection learning path we explain the various data collection methods and techniques for determining the operational characteristics of a collection system. We explore how to define a collection system able to handle the frequency of data change and the type of data being ingested. We identify how to enforce data properties such as order, data structure, and metadata, and to ensure the durability and availability for our collection approach Intended audience:

Cloud Academy – AWS Big Data Athena

Cloud Academy – AWS Big Data Athena-STM
English | Size: 409.74 MB
Category: CBTs

In this course, we will perform an in-depth review of the Amazon Athena service. We will review and explain fundamental AWS Athena storage and querying concepts. We will highlight suitable use cases in which Athena can be applied effectively. You will be introduced to the basic underlying technology that Athena has been built on. We spend time discussing the process of creating and setting up Athena databases, tables, and partitions. We examine the process in which Athena SQL queries are authored and how they are managed. We review current Athena limitations and pricing. Finally, we will provide a demonstration in which we publish CloudTrail logs into an S3 bucket. We make some ad-hoc security group changes to generate a few CloudTrail events – and finally we ll use Athena to search and find the captured security group API update calls