In this intermediate-level course, individuals learn how to solve a real-world use case with Machine Learning (ML) and produce actionable results using Amazon SageMaker. This course walks through the stages of a typical data science process for Machine Learning from analyzing and visualizing a dataset to preparing the data, and feature engineering. Individuals will also […]
This course introduces students to the Cloud Computing value proposition; Cloud Computing solution models, and core Amazon Web Services (AWS) services and foundational technologies. Course attendees are provided with insights that will enable them to intelligently translate their organization’s business requirements into Cloud and AWS-based IT solutions. Topics covered include: Articulate the Cloud Computing Business […]
Big Data on AWS introduces you to cloud-based big data solutions such as Amazon Elastic MapReduce (EMR), Amazon Redshift, Amazon Kinesis and the rest of the AWS big data platform. In this course, we show you how to use Amazon EMR to process data using the broad ecosystem of Hadoop tools like Hive and Hue. […]
This course provides an in-depth overview of the choices you have in processing Big Data. It introduces Big Data, the types of data you might have, approaches to working on and processing the data, and the capabilities, strengths, and weaknesses of those approaches. Topics covered include: NewSQL Databases NoSQL Overview Hadoop and MapReduce Apache Pig […]
This objective of this course is to fully explore the uses of Microsoft Excel 365 as a Data Analytics tool. Most business professionals are familiar with the core functionality of Excel. This course explores some of the additional capabilities and advanced features of Excel for analyzing, manipulating and visualizing data.
Success of many organizations depends on their ability to derive business insights from massive amount of raw data coming from various sources. Apache Spark offers many engineering improvements over the traditional MapReduce programming model as implemented in Hadoop by providing multi-pass in-memory processing of data which boosts the overall performance of your ETL and machine-learning […]
Matplotlib is a data visualization library for Python. As part of the SciPy data analysis library it is widely used to create data graphics. However, Matplotlib is older than the pandas library, the most common Python library for data frame manipulation. The Matplotlib library requires some extra steps when plotting data from pandas data frames […]
This course introduces Tableau with an emphasis on creating powerful visualizations with your data. We will connect to data sources and perform basic filtering before displaying the data. The class is a mixture of lecture and hands-on labs. Topics covered include: Tableau Overview Connecting to data sources Data Visualization Plotting Bar charts, pie charts Heat […]
Data Warehousing on AWS introduces you to concepts, strategies, and best practices for designing a cloud-based data warehousing solution using Amazon Redshift, the petabyte-scale data warehouse in AWS. This course demonstrates how to collect, store, and prepare data for the data warehouse by using other AWS services such as Amazon DynamoDB, Amazon EMR, Amazon Kinesis […]
This training course introduces the students to Apache Hadoop and key Hadoop ecosystem projects: Pig, Hive, Sqoop, and Spark. This training course is supplemented by a variety of hands-on labs that help attendees reinforce their theoretical knowledge of the learned material and gain practical experience of working with Apache Hadoop and related Apache projects.
We are constantly faced with a vast amount of complex information – often more than we can handle. Well-designed visual interpretations of data improve comprehension, communication, and decision making. This workshop introduces data methods and techniques that increase the understanding of complex data. The focus is on conveying ideas effectively with visually appealing charts, graphs and […]
In recent years industry, not just academia, has found that creating powerful data models provides the next level of value past traditional business intelligence. This course focuses on state of the art machine learning techniques combined with a practical approach designed to teach you to process your data and build models using Python’s scikit-learn. In […]