Manning Publication – The Ultimate Introduction to Big Data

Manning Publication – The Ultimate Introduction to Big Data
English | Size: 4.40 GB
Category: Tutorial


Designed for data storage and processing, Hadoop is a reliable, fault-tolerant operating system. The most celebrated features of this open source Apache project are HDFS, Hadoop’s highly-scalable distributed file system, and the MapReduce data processing engine. Together, they can process vast amounts of data across large clusters. An ecosystem of hundreds of technologies has sprung up around Hadoop to answer the ever-growing demand for large-scale data processing solutions. Understanding the architecture of massive-scale data processing applications is an increasingly important and desirable skill, and you’ll have it when you complete this liveVideo course!
[Read more…]

SKILLSHARE BIG DATA ANALYSIS WITH APACHE SPARK PYSPARK PYTHON

SKILLSHARE BIG DATA ANALYSIS WITH APACHE SPARK PYSPARK PYTHON
English | Size: 337.27 MB
Category: Tutorial

Spark can perform up to 100x faster than Hadoop MapReduce Data processing framework, Which makes apache spark one of most demanded skills.

The top companies like Google, Facebook, Microsoft, Amazon, Airbnb using Apache Spark to solve their big data problems!. Data analysis, on huge amount of data is one of the most valuable skills now a days and This course will teach such kind of skills to complete in big data job market. [Read more…]

Packt Hands-On PySpark for Big Data Analysis

Packt Hands-On PySpark for Big Data Analysis
English | Size: 658.47 MB
Category: Tutorial

Data is an incredible asset, especially when there are lots of it. Exploratory data analysis, business intelligence, and machine learning all depend on processing and analyzing Big Data at scale.

How do you go from working on prototypes on your local machine, to handling messy data in production and at scale?

This is a practical, hands-on course that shows you how to use Spark and it’s Python API to create performant analytics with large-scale data. Don’t reinvent the wheel, and wow your clients by building robust and responsible applications on Big Data. [Read more…]

Cisco NetFlow LiveLessons Big Data Analytics for Cyber Security

Cisco NetFlow LiveLessons: Big Data Analytics for Cyber Security
English | Size: 6.28 GB
Category: Cisco | E-learning | Security | others

Cisco NetFlow LiveLessons: Big Data Analytics for Cyber Security
Copyright 2016
Edition: 1st
Online Video
ISBN-10: 0-13-446985-2
ISBN-13: 978-0-13-446985-0

More than 6 hours of video training covering everything you need to know to deploy, configure, and troubleshoot NetFlow in many different Cisco platforms and learn big data analytics technologies for cyber security. [Read more…]

Big Data Principles and Best Practices of Scalable Realtime Data Systems

Big Data Principles and Best Practices of Scalable Realtime Data Systems
English | Size: 252.36 MB
Category: Audio boook

Big Data teaches you to build big data systems using an architecture designed specifically to capture and analyze web-scale data. This book presents the Lambda Architecture, a scalable, easy-to-understand approach that can be built and run by a small team. You’ll explore the theory of big data systems and how to implement them in practice. In addition to discovering a general framework for processing big data, you’ll learn specific technologies like Hadoop, Storm, and NoSQL databases. [Read more…]

Packtpub – Hands-On Big Data Analysis with Hadoop 3

Packtpub – Hands-On Big Data Analysis with Hadoop 3
English | Size: 450 MB
Category: Programming | E-learning | others

This course is your guide to performing real-time data analytics and stream processing with Spark. Use different components and tools such as HDFS, HBase, and Hive to process raw data. Learn how tools such as Hive and Pig aid in this process.
In this course, you will start off by learning data analysis techniques with Hadoop using tools such as Hive. Furthermore, you will learn to apply these techniques in real-world big data applications. Also, you will delve into Spark and its related tools to perform real-time data analytics, streaming, and batch processing on your application.
Finally, you’ll learn how to extend your analytics solutions to the cloud. [Read more…]

Udemy – Taming Big Data with Apache Spark and Python – Hands On (Feb-2018)

Udemy – Taming Big Data with Apache Spark and Python – Hands On (Feb-2018)
English | Size: 1.42 GB
Category: Tutorial

Created by Sundog Education by Frank Kane, Frank Kane
Last updated 2/2018
English
What Will I Learn?
Frame big data analysis problems as Spark problems
Use Amazon’s Elastic MapReduce service to run your job on a cluster with Hadoop YARN
Install and run Apache Spark on a desktop computer or on a cluster
Use Spark’s Resilient Distributed Datasets to process and analyze large data sets across many CPU’s
Implement iterative algorithms such as breadth-first-search using Spark
Use the MLLib machine learning library to answer common data mining questions
Understand how Spark SQL lets you work with structured data
Understand how Spark Streaming lets your process continuous streams of data in real time
Tune and troubleshoot large jobs running on a cluster
Share information between nodes on a Spark cluster using broadcast variables and accumulators
Understand how the GraphX library helps with network analysis problems
Requirements [Read more…]

Lynda – Architecting Big Data Applications – Real-Time Application Engineering

Lynda – Architecting Big Data Applications – Real-Time Application Engineering
English | Size: 103.64 MB
Category: Tutorial

Real-time systems have guaranteed response times that can be sub-seconds from the trigger. Meaning that when a user clicks a button, your app better respond-and fast. Architecting applications under real-time constraints is an even bigger challenge when you’re dealing with big data. Excessive latency can cost you money, in terms of system resources consumed and customers lost. Luckily, big data technology and efficient architecture can provide the real-time responsiveness your business needs. In this course, you can learn about use cases and best practices for architecting real-time applications with technologies such as Kafka, Hazelcast, and Apache Spark.
[Read more…]

Lynda – Architecting Big Data Applications – Batch Mode Application Engineering

Lynda – Architecting Big Data Applications – Batch Mode Application Engineering
English | Size: 178.14 MB
Category: Tutorial

Batch mode consolidates data-related operations in order to reduce the load on networks. Batch mode helps software architects build big data applications that operate smoothly and efficiently under real-world conditions. In this course, you can learn about use cases and best practices for architecting batch mode applications using technologies such as Hive and Apache Spark.
[Read more…]

SQL on Hadoop – Analyzing Big Data with Hive

SQL on Hadoop – Analyzing Big Data with Hive
English | Size: 406.67 MB
Category: Tutorial

This course will teach you the Hive query language and how to apply it to solve common Big Data problems. This includes an introduction to distributed computing, Hadoop, and MapReduce fundamentals and the latest features released with Hive 0.11
[Read more…]

Skip to toolbar