Apache Spark Streaming with Python and PySpark | SMARTYBRO
16 Apr , 2018
Requirements
Python Programming Skills
A computer with decent memory & internet connection
Either Ubuntu, Mac OS, or Windows as an operating system
Description
What is this course about: This course teaches you how to use the python bindings for Apache Spark’s data streaming capabilities. This course will be absolutely critical to anyone trying to make it in data science today.
What will you learn from this lecture: In this couse, you’ll learn how to use Apache Spark for data streaming, and how to use it wth the lingua franca of data science: Python. You’ll see demos of how to handle and manipulate many different types of data, as well as get hands on experience with exercises, such as making a Twitter analytics tool.
You’ll also learn how to use PySpark with other popular streaming tools like Apache Kafka (used by Fortune 500 companies like LinkedIn for their data Streaming) and AWS tools like Kinesis.
Why should you learn Apache Spark streaming: Spark streaming is becoming incredibly popular, and with good reason. According to IBM, Ninety percent of the data in the world today has been created in the last two years alone. Our current output of data is roughly 2.5 quintillion bytes per day. The world is being immersed in data, moreso each and every day.
As such, analyzing static dataframes of non-dynamic data becomes the less practical approach to more and more problems. This is where data streaming comes in, the ability to process data almost as soon as it’s produced, recognizing the time-dependency of the data.
What programming language is this course taught in: Python 3 (with heavy use of Jupyter Notebooks)
Who is the target audience?
Python Developers looking to get better at Data Streaming
Managers or Senior Engineers in Data Engineering Teams
This website uses cookies to improve your experience. We'll assume you're ok with this "Cookie settings", but you can opt-out if you wish.ACCEPT
Privacy & Cookies Policy
Privacy Overview
This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are as essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience.
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.