Introduction to big-data using PySpark: Glossary

Key Points

Introduction to UIO galaxy eduPortal
  • Use UIO Galaxy eduPortal to start a pySpark jupyter notebook

  • Start a pySpark jupyter notebook from the UIO Galaxy eduPortal.

Map-filter-Reduce in python
  • Use python for writing map, filter and reduce

Introduction to (Py)Spark
  • Spark and RDD

Introduction to Spark SQL
  • Spark SQL select

Glossary

FIXME