Apache Kafka Series – Learn Apache Kafka for Beginners

Apache Kafka Series – Learn Apache Kafka for Beginner
English | Size: 1.03 GB
Category: CBTs

Learn about Apache Kafka Ecosystem, Architecture, Core Concepts and Operations
Master Fundamental Concepts behind Apache Kafka Like Topics, Partitions, Brokers, Producers, Consumers and Many More
Start a personal Kafka Cluster for development purposes (we’ll install and use Docker for this)
Create and Configure Topics and Start Writing Data to and Reading Data from Topic
Integrate with popular programming frameworks, such as Spark, Spark Streaming, Akka Actors, Akka Stream, Scala and Apache NiFi
Lehrplan für diesen Kurs
[Read more…]

Packt Apache Kafka Series Learn Apache Kafka for Beginners-XQZT

Packt Apache Kafka Series Learn Apache Kafka for Beginners-XQZT
English | Size: 3.87 GB
Category: Tutorial

Apache Kafka has become the leading data-streaming enterprise technology. Kafka is used in production by over 2000 companies like Netflix, Airbnb, Uber and LinkedIn. To learn Kafka easily, step-by-step, you have come to the right place! Apache Kafka and its ecosystem: In this section, we will learn about the Apache Kafka ecosystem, and see how some target architectures may look. This high-level section helps you to set context around Kafka! Apache Kafka Core concepts: In this section, we will learn about all the fundamental concepts of Kafka like topics, partitions, replication, brokers, producers, consumers, consumer groups, Zookeeper, delivery semantics, and more! Docker and Kafka Setup: In this section, we will learn how to install Docker on your machine and get started with Apache Kafka, in the simplest way possible. Apache Kafka Hands-on Practice: In this section, we will gain some practical experience by learning how the various command lines tool work, as well as how to use the Kafka Topics UI, and create your very first producer and consumer in Java. Code Examples – Libraries Integrations: In this section, we will learn about some more advanced code examples, and understand where to find the libraries to integrate with frameworks such as Spark, Spark Streaming, Akka Streams, Scala, Actors, Apache NiFi. Advanced Topic Configuration: In this section, we will understand the main configurations for your topics, learn about log compaction, and understand exactly what your partitions are made of! [Read more…]

Packt Introduction to Apache NiFi (Hortonworks DataFlow HDF 2 0)-XQZT

Packt Introduction to Apache NiFi (Hortonworks DataFlow HDF 2 0)-XQZT
English | Size: 2.00 GB
Category: CBTs

Apache NiFi was initially used by the NSA so they could move data at scale and was then open sourced. Being such a hot technology, Onyara (the company behind it) was then acquired by Hortonworks, one of the main backers of the big data project Hadoop, and then Hadoop Data Platform. Apache NiFi is now used in many top organisations that want to harness the power of their fast data by sourcing and transferring information from and to their database and big data lakes. It is a key tool to learn for the analyst and data scientists alike. Its simplicity and drag and drop interface make it a breeze to use! You can start building flows between Kafka and ElasticSearch, an FT,P and MongoDB, and so much more! Your imagination is the limit This course will take you through the Apache NiFi technology. It will help you understand its fundamental concepts, with theory lessons that walk you through the core concepts of Apache NiFi. You will also have hands-on labs to get started and build your first data flows. You will learn how to set up your connectors, processors, and how to read your FlowFiles to make the most of what NiFi has to offer. The most important configuration options will be demonstrated so you will be able to get started in no time. We will also analyse a template picked from the web and understand how to debug your flows as well as route your data to different processors based on outcomes through relationships. We will finally learn about the integrations between NiFi and Apache Kafka or MongoDB. Lots of learning ahead! [Read more…]

Docker, Apache Mesos & DCOS Run and manage cloud datacenter

Docker, Apache Mesos & DCOS Run and manage cloud datacenter
English | Size: 1.23 GB
Category: CBTs

Docker is open source engine that can help you automate the deployment of applications inside software containers. Is was released in March 2013 and has been gaining popularity ever since. It has over 100 million downloads, and over 75000 applications are running as dockerized applications – that is a LOT! Apache Mesos is an open source cluster manager that provides efficient resource isolation and sharing across distributed applications or frameworks. Mesosphere DC/OS is an enterprise-grade datacenter-scale operating system, providing a single platform for running containers, big data, and distributed apps in production. DC/OS is built on the Apache Mesos core and provides newer technology including the native container-orchestration, Marathon application platform, intuitive user interfaces and much more. Knowledge and experience about Docker, Apache Mesos, and DC/OS could be very valuable for your career. [Read more…]

Udemy – Taming Big Data with Apache Spark and Python – Hands On (Feb-2018)

Udemy – Taming Big Data with Apache Spark and Python – Hands On (Feb-2018)
English | Size: 1.42 GB
Category: Tutorial

Created by Sundog Education by Frank Kane, Frank Kane
Last updated 2/2018
English
What Will I Learn?
Frame big data analysis problems as Spark problems
Use Amazon’s Elastic MapReduce service to run your job on a cluster with Hadoop YARN
Install and run Apache Spark on a desktop computer or on a cluster
Use Spark’s Resilient Distributed Datasets to process and analyze large data sets across many CPU’s
Implement iterative algorithms such as breadth-first-search using Spark
Use the MLLib machine learning library to answer common data mining questions
Understand how Spark SQL lets you work with structured data
Understand how Spark Streaming lets your process continuous streams of data in real time
Tune and troubleshoot large jobs running on a cluster
Share information between nodes on a Spark cluster using broadcast variables and accumulators
Understand how the GraphX library helps with network analysis problems
Requirements [Read more…]

Packt Publishing – Cloud Computing with Apache CloudStack – Run your own cloud

Packt Publishing – Cloud Computing with Apache CloudStack – Run your own cloud
English | Size: 2.69 GB
Category: CBTs

This course provides detailed demos of installation and configuration of Apache CloudStack. Thus it will equip you well for future use of this technology.

Did you know cloud computing is one of the leading industries around the world which is experiencing astounding growth year over year? When you talk about cloud computing – what names or terms come to mind? AWS, Azure? Well, that is great – but what about building your own private cloud infrastructure? Do you ever wonder how do these public cloud provider companies stand up and maintain such massive infrastructure? Do you ever wonder how YOU can build a private cloud instance for your project or company? [Read more…]

Lynda – Apache Spark Essential Training – Big Data Engineering

Lynda – Apache Spark Essential Training – Big Data Engineering
English | Size: 368.79 MB
Category: Tutorial

In order to construct data pipelines and networks that stream, process, and store data, data engineers and data-science DevOps specialists must understand how to combine multiple big data technologies. In this course, discover how to build big data pipelines around Apache Spark. Join Kumaran Ponnambalam as he takes you through how to make Apache Spark work with other big data technologies. He covers the basics of Apache Kafka Connect and how to integrate it with Spark for real-time streaming. In addition, he demonstrates how to use the various technologies to construct an end-to-end project that solves a real-world business problem. [Read more…]

Packt Publishing – Real Time Streaming using Apache Spark Streaming

Packt Publishing – Real Time Streaming using Apache Spark Streaming
English | Size: 208.13 MB
Category: CBTs

Spark is the technology that allows us to perform big data processing in the MapReduce paradigm very rapidly, due to performing the processing in memory without the need for extensive I/O operations.

Recently, the streaming approach to processing events in near real time became more widely adopted and more necessary. In this course, you will learn how to handle big amount of unbounded infinite streams of data. You will analyze data and draw conclusions from it. Furthermore, we will look at common problems when processing event streams: sorting, watermarks, deduplication, and keeping state (for example, user sessions). You will also implement streaming processing using Spark Streaming and analyze traffic on a web page in real time. [Read more…]

Apache CloudStack – Install, build and run IaaS cloud

Apache CloudStack – Install, build and run IaaS cloud
English | Size: 505.55 MB
Category: Tutorial

Did you know cloud computing is one of the leading industries around the world which is experiencing astounding growth year over year?
Did you know cloud computing is one of the leading industries around the world which is experiencing astounding growth year over year?
When you talk about cloud computing – what names or terms come to mind? AWS, Azure?
Well, that is great – but what about building your own private cloud infrastructure?
Do you ever wonder how do these public cloud provider companies stand up and maintain such massive infrastructure? [Read more…]

Lynda – Installing Apache, MySQL, and PHP

Lynda – Installing Apache, MySQL, and PHP
English | Size: 324.92 MB
Category: Tutorial

This course describes how to install and configure Apache HTTP Server, MySQL database server, and PHP, known collectively as the AMP stack, on a local development computer. David Gassner covers different installation approaches, including installing the components separately on Windows, macOS, and Linux and installing the prepackaged WampServer and MAMP bundles. Plus, learn how to troubleshoot port conflicts and other AMP-related issues.
[Read more…]

Skip to toolbar