Learning Hadoop Preview

Learning Hadoop

Start my 1-month free trial

Course details

Hadoop is indispensable when it comes to processing big data—as necessary to understanding your information as servers are to storing it. This course is your introduction to Hadoop; key file systems used with Hadoop; its processing engine, MapReduce, and its many libraries and programming tools. Developer and big-data consultant Lynn Langit shows how to set up a Hadoop development environment, run and optimize MapReduce jobs, code basic queries with Hive and Pig, and build workflows to schedule jobs. Plus, learn about the depth and breadth of available Apache Spark libraries available for use with a Hadoop cluster, as well as options for running machine learning jobs on a Hadoop cluster.

Instructor

  • Click here to view Lynn Langit’s instructor page

    Lynn Langit

    Cloud Architect, Developer, Angel Investor

    Lynn Langit is a cloud architect who works with Amazon Web Services and Google Cloud Platform.

    Lynn specializes in big data projects. She has worked with AWS Athena, Aurora, Redshift, Kinesis, and the IoT. She has also done production work with Databricks for Apache Spark and Google Cloud Dataproc, Bigtable, BigQuery, and Cloud Spanner.

    Lynn is also the cofounder of Teaching Kids Programming. She has spoken on data and cloud technologies in North and South America, Europe, Africa, Asia, and Australia.

Skills covered in this course

Viewers of this course

36,484 people watched this course

Contents