A systematic Benchmarking on the performance of Spark-SQL for processing Vast RDF datasets
This project is maintained by DataSystemsGroupUT
These figures show the comparative representation of Storage file formats (i.e. HDFS [CSV,AVRO, PARQUET, ORC]) for 100M, 250M, and 500M respectively.