Available courses

Stat 286 - Data Science Ecosystems

Introduction to the basics of the Linux operating system. Command line interface is used to explore Linux/Unix utilities and tools. Fundamental concepts of python programming. Create and run python scripts from the command line. Introduction to database management systems, the basic structure of relational databases, and how to manage databases with Structured Query Language (SQL). Read and write simple and complex SQL statements. Integrate R, Python and SQL skills.  Introduction to Big Data Hadoop and Spark ecosystems.

An Introduction to 

 

  1. Visualization
  2. Hadoop, HDFS, Ecosystem that includes MapReduce, Spark, HBase, Pig, Hive, NoSQL, ...