Data and Visual Analytics

less than 1 minute read

Description:

Four assignments demonstrating techniques and tools for analyzing and visualizing data at scale. Languages & Frameworks used: JavaScript (D3.js), Python, Java, Scala, Pig, SQL, HTML, CSS, Hadoop, Spark, and AWS.

Data and Visual Analytics Assignments:

  1. HW1-Python, SQLite, D3, Gephi, and OpenRefine
  2. HW2-D3 Graphs and Visualization
  3. HW3-Hadoop, Spark, Pig, AWS and Azure
  4. HW4-Scalable PageRank, Random Forest, Weka

Languages:
Javascript, Python (2.7), Java, Scala, Pig, SQL, HTML, CSS

Libraries:
D3.js, TopoJSON, NumPy, SciPy

Frameworks:
Flask, Hadoop, Spark

Relational Database Management System:
SQLite3

Software/Applications:
Weka, Gephi, OpenRefine, Tableau, VirtualBox (with VM image from CDH), Azure ML Studio

Cloud Services:
Amazon Web Services (AWS):

  • Storage (S3)
  • Elastic Cloud Computing (EC2)
  • Elastic MapReduce (EMR)

Microsoft Azure:

  • Azure Blob Storage
  • HDInsight (Linux Cluster)