Hadoop is a mature Big Data environment and Hive is the de-facto standard for the SQL interface. Today, the computations in Hadoop are usually done with Spark. Spark offers an optimized compute engine that includes batch, and real-time streaming, and machine learning. This course covers Hadoop 3, Hive 3, and Spark 3.
This course is a survey of processes and tools commonly used in applications that rely heavily on data analysis. The course will describe data pipelines deployed by data engineers and data scientists to ingest data for use in an application and to manipulate that data for use by analysts. Hands-on activities will include a combination […]
We are constantly faced with a vast amount of complex information – often more than we can handle. Well-designed visual interpretations of data improve comprehension, communication, and decision making. This workshop introduces data methods and techniques that increase the understanding of complex data. The focus is on conveying ideas effectively with visually appealing charts, graphs and […]
This course introduces participants to both supervised and unsupervised learning algorithms with discussion of what datasets lend themselves to solutions with the various ML techniques. Hands-on labs are designed to assist the learner in understanding the concepts and are all done using Jupyter Notebooks. Where necessary, background material in Linear Algebra, Probability, and Python will […]
This course covers the various methods and best practices that are in line with business and technical requirements for modeling, visualizing, and analyzing data with Power BI. The course will show how to access and process data from a range of data sources including both relational and non-relational sources. Additionally, this course will also discuss […]
In this course, you will learn about the process of planning and designing both relational and nonrelational databases. You will learn the design considerations for hosting databases on Amazon Elastic Compute Cloud (Amazon EC2). You will learn about our relational database services including Amazon Relational Database Service (Amazon RDS), Amazon Aurora, and Amazon Redshift. You […]
This intensive hands-on training introduces the audience to the core aspects of scalable data processing using Python on the Apache Spark platform. The students will learn the essentials of Python with the primary focus being on the capabilities of the Apache Spark platform and its Machine Learning module. The students will be introduced to the […]
41.7% of Developers Use Python 73.1% of Developers Love Using Python Fastest Growing Language Today! This course introduces the Python language to students who want to use Python as a tool for their data science initiatives. The goal is to become proficient enough with the Python language to leverage powerful Data Science packages such as […]
This course teaches many concepts and capabilities of the R programming language. Some of the topics include importing data, data visualization using ggplot2, built-in R datatypes & structures, and general R syntax. Upon completion of the course students will be able to import, analyze, and summarize large, complex data sets using R.
This course provides you with an overview of Structured Query Language (SQL) so that you can quickly begin working with and analyzing data with other data science tools. Before you can analyze data, you need to have the correct data. Many organizations store their data in structured databases and SQL is the language of choice to […]
Building on concepts introduced in Architecting on AWS, Advanced Architecting on AWS is intended for individuals who are experienced with designing scalable and elastic applications on the AWS platform. Building on concepts introduced in Architecting on AWS, this course covers how to build complex solutions which incorporate data services, governance, and security on AWS. This […]
The Advanced Developing on AWS course uses the real-world scenario of taking a legacy, on-premises monolithic application and refactoring it into a serverless microservices architecture. This three-day advanced course covers advanced development topics such as architecting for a cloud-native environment; deconstructing on-premises, legacy applications and repackaging them into cloud-based, cloud-native architectures; and applying the tenets […]