AccountIcon BigDataIcon BlogIcon default_resource_icon CartIcon checkmark_icon cloud_devops_icon computer_network_admin_icon cyber_security_icon gsa_schedule_icon human_resources_icon location_icon phone_icon plus_icon programming_software_icon project_management_icon redhat_linux_icon search_icon sonography_icon sql_database_icon webinar_icon

Search UMBC Training Centers

Big Data Analytics

Python for Data Science

+ View more dates & times
  • Overview

    This course introduces the Python language to students who want to use Python as a tool for their data science initiatives. The goal is to become proficient enough with the Python language to leverage powerful Data Science packages such as Pandas and matplotlib.

    This is a comprehensive introduction to Python programming with a focus on understanding and using the Pandas library for storing data in DataFrames and plotting portions of the data with matplotlib. In addition to data visualization, you will learn how to use the Pandas library to import and filter data. Typical data science skills such as data interpretation and analysis will be addressed.

  • Who Should Take This Course

    Audience

    This course is suitable for: Data analysts, Data scientists, Data engineers, and Developers.

    PREREQUISITES

    Students should have a basic proficiency in some programming language. Prerequisite language skills include understanding of datatypes, Boolean logic, control flow and basics of collections, such as arrays or hash tables. An understanding of using Excel for data manipulation is helpful.

  • Schedule
  • Course Outline

    Getting Around in Python

    · Using Python at the Command Line

    · Running the Interactive Shell

    · Using Jupyter Notebooks

    Jupyter Notebook Basics

    · Cell Types

    · Edit and Command Mode

    · Running cells

    · Output

    · Restarting the Kernel

    · Exporting the Notebook

    · Cell and Line Magics

    Python Basics

    · Comments, Indenting, print()

    · Variables

    · Types

    · Operators

    · Control Flow

    Collections

    · Lists

    · Tuples

    · Sets

    · Dictionaries

    Comprehensions

    · List and Set Type Comprehensions

    · Comprehensions as Generator Expressions

    Functions and Lambda Expressions

    · Built-in Functions

    · User-defined functions

    · Anonymous in-line functions

    Using Modules

    · Importing and Selective Importing

    · Properties

    · Methods

    · random and math Modules

    Data Sources and Formats

    · CSV, TSV

    · JSON

    · SQL

    · Others: XML, YAML, Splunk

    Using NumPy

    · ndarray

    · Indexing and Slicing

    · Masking and Broadcasting

    · Sorting

    Pandas Basics

    · Why Pandas?

    · Series

    · DataFrames

    · Populating DataFrames

    · Importing CSV, Excel, SQL Data

    · DataFrame Columns and Cells

    · DataFrame Retrieval

    Pandas and Data Analysis

    · Functions on DataFrames

    · Mapping

    · Using Lambdas

    · Sorting

    · Statistics

    · Merging and Concatenating DataFrames

    · Data Cleaning

    · Data Analysis

    · Groupby

    · Aggregate Functions

    Data Visualization

    · Plotting with matplotlib

    · Enhancing Visualizations with seaborn

  • FAQs
    Is there a discount available for current students?

    UMBC students and alumni, as well as students who have previously taken a public training course with UMBC Training Centers are eligible for a 10% discount, capped at $250. Please provide a copy of your UMBC student ID or an unofficial transcript or the name of the UMBC Training Centers course you have completed. Online courses are excluded from this offer.

    What is the cancellation and refund policy?

    Student will receive a refund of paid registration fees only if UMBC Training Centers receives a notice of cancellation at least 10 business days prior to the class start date for classes or the exam date for exams.

Contact Us