AccountIcon BigDataIcon BlogIcon default_resource_icon CartIcon checkmark_icon cloud_devops_icon computer_network_admin_icon cyber_security_icon gsa_schedule_icon human_resources_icon location_icon phone_icon plus_icon programming_software_icon project_management_icon redhat_linux_icon search_icon sonography_icon sql_database_icon webinar_icon

Search UMBC Training Centers

Big Data Analytics

Hortonworks HDP Operations: Administration Foundations

+ View more dates & times
  • Overview

    This course is intended for systems administrators who will be responsible for the design, installation, configuration, and management of the Hortonworks Data Platform (HDP). The course provides in-depth knowledge and experience in using Apache Ambari as the operational management platform for HDP. This course presumes no prior knowledge or experience with Hadoop.

  • Who Should Take This Course

    AUDIENCE

    Linux administrators and system operators responsible for installing, configuring and managing an HDP cluster.

    PREREQUISITES

    Students must have experience working in a Linux environment with standard Linux system commands. Students should be able to read and execute basic Linux shell scripts. Basic knowledge of SQL statements is recommended, but not a requirement. In addition, it is recommended for students to have some operational experience in data center practices, such as change management, release management, incident management, and problem management.

  • Schedule
  • Course Outline

    DAY 1 OBJECTIVES

    • Describe Apache Hadoop
    • Summarize the Purpose of the Hortonworks Data Platform Software Frameworks
    • List Hadoop Cluster Management Choices
    • Describe Apache Ambari
    • Identify Hadoop Cluster Deployment Options
    • Plan for a Hadoop Cluster Deployment
    • Perform an Interactive HDP Installation using Apache Ambari
    • Install Apache Ambari
    • Describe the Differences Between Hadoop Users, Hadoop Service Owners, and Apache Ambari Users
    • Manage Users, Groups and Permissions
    • Identify Hadoop Configuration Files
    • Summarize Operations of the Web UI Tool
    • Manage Hadoop Service Configuration Properties Using the Apache Ambari Web UI
    • Describe the Hadoop Distributed File System (HDFS)
    • Perform HDFS Shell Operations
    • Use WebHDFS
    • Protect Data Using HDFS Access Control Lists (ACLs)

    DAY 1 LABS

    • Setting Up the Environment
    • Installing HDP
    • Managing Ambari Users and Groups
    • Managing Hadoop Services
    • Using HDFS Storage
    • Using WebHDFS
    • Using HDFS Access Control Lists

    DAY 2 OBJECTIVES

    • Describe HDFS Architecture and Operation
    • Manage HDFS using Ambari Web, NameNode and DataNode UIs
    • Manage HDFS using Command-line Tools
    • Summarize the Purpose and Benefits of Rack Awareness
    • Configure Rack Awareness
    • Summarize Hadoop Backup Considerations
    • Enable and Manage HDFS Snapshots
    • Copy Data Using DistCP
    • Use Snapshots and DistCP Together
    • Identify the Purpose and Operation of Heterogeneous HDFS Storage
    • Summarize the Purpose and Operation of HDFS Centralized Caching
    • Configure HDFS Centralized Cache
    • Define and Manage Cache Pools and Cache Directives
    • Identify HDFS NFS Gateway Use Cases
    • Recall HDFS NFS Gateway Architecture and Operation
    • Install and Configure an HDFS NFS Gateway
    • Configure an HDFS NFS Gateway Client

    DAY 2 LABS

    • Managing HDFS Storage
    • Managing HDFS Quotas
    • Configuring Rack Awareness
    • Managing HDFS Snapshots
    • Using DistCP
    • Configuring HDFS Storage Policies
    • Configuring HDFS Centralized Cache
    • Configuring an NFS Gateway

    DAY 3 OBJECTIVES

    • Describe YARN Resource Management
    • Summarize YARN Architecture and Operation
    • Identify and Use YARN Management Options
    • Summarize YARN Response to Component Failure
    • Understand the Basics of Running Simple YARN Applications
    • Summarize the Purpose and Operation of the YARN Capacity Scheduler
    • Configure and Manage YARN Queues
    • Control Access to YARN Queues
    • Summarize the Purpose and Operation of YARN Node Labels
    • Describe the Process used to Create Node Labels
    • Describe the Process Used to Add, Modify and Remove Node Labels
    • Configure Queues to Access Node Label Resources
    • Run Test Jobs to Confirm Node Label Behavior

    DAY 3 LABS

    • Managing YARN Using Ambari
    • Managing YARN Using CLI
    • Running Sample YARN Applications
    • Setting Up for Capacity Scheduler
    • Managing YARN Containers and Queues
    • Managing YARN ACLs and User Limits
    • Working with YARN Node Labels

    DAY 4 OBJECTIVES

    • Summarize the Purpose of NameNode HA
    • Configure NameNode HA Using Ambari
    • Summarize the Purpose of ResourceManager HA
    • Configure ResourceManager HA using Apache Ambari
    • Identify Reasons to Add, Replace and Delete Worker Nodes
    • Demonstrate How to Add a Worker Node
    • Configure and Run the HDFS Balancer
    • Decommission and Re-commission a Worker Node
    • Describe the Process of Moving a Master Component
    • Summarize the Purpose and Operation of Apache Ambari Metrics
    • Describe the Features and Benefits of the Apache Ambari Dashboard
    • Summarize the Purpose and Benefits of Apache Ambari Blueprints
    • Recall the Process Used to Deploy a Cluster Using Ambari Blueprints
    • Recall the Definition of an HDP Stack and Interpret its Version Number
    • View the Current Stack and Identify Compatible Apache Ambari Software Versions
    • Recall the Types of Methods and Upgrades Available in HDP
    • Describe the Upgrade Process, Restrictions and Pre-upgrade Checklist
    • Perform an Upgrade Using the Apache Ambari Web UI

    DAY 4 LABS

    • Configuring NameNode HA
    • Configuring Resource Manager HA
    • Adding, Decommissioning and Re-commissioning a Worker Node
    • Configuring Ambari Alerts
    • Deploying an HDP Cluster Using Ambari Blueprints
    • Performing an HDP Upgrade – Express
  • FAQs
    Is there a discount available for current students?

    UMBC students and alumni, as well as students who have previously taken a public training course with UMBC Training Centers are eligible for a 10% discount, capped at $250. Please provide a copy of your UMBC student ID or an unofficial transcript or the name of the UMBC Training Centers course you have completed. Online courses are excluded from this offer.

    What is the cancellation and refund policy?

    Student will receive a refund of paid registration fees only if UMBC Training Centers receives a notice of cancellation at least 10 business days prior to the class start date for classes or the exam date for exams.

Contact Us