# Additional Help for SQL++ and Mondial The [AsterixDB documentation on SQL++](https://asterixdb.apache.org/docs/0.9.4/sqlpp/manual.html) is a very valuable resource. This guide is only intended to give more tangible examples and explanation as to how SQL++ works and point out probably useful features of SQL++ for the homework. Completing this HW is not at all dependent on the material covered here. You are allowed to use the entirety of SQL++ to complete the assignment. Understanding Your Data (`mondial.adm`)
2021-08-24 14:28:36 78.99MB CSE414 HW6 axBase
CSE 414 Homework 7: Parallel Data Processing and Spark Objectives: To write distributed queries. To learn about Spark and running distributed data processing in the cloud using AWS. What to turn in: Your Spark code in the sparkapp.py file. Spark Programming Assignment (75 points) In this homework, you will be writing Spark and Spark SQL code, to be executed both locally on your machine and also using Amazon Web Services. We will be using a similar flight dataset used in previous homework. This time, however, we will be using the entire data dump from the US Bureau of Transportation Statistics, which consists of information about all domestic US flights from 1987 to 2011 or so. The data is in Parquet format. Your local runs/tests will use a subset of the data (in the flights_small directory) and your cloud jobs will use the full data (stored on Amazon S3)
2021-08-24 14:28:09 2.34MB CSE414 HW7 pySpark