Processing a large text file.
Sigiloso
Can convert the text file in to a java spark RDD object, then process the object by using map-reduce operations. While you have to be aware of: 1) Heap memory to be allocated for spark 2) Number of CPU cores (For example:must be less than 8 if 8 core on your local machine) to be allocated for processing 3) Persistance of objects in-meory or disk. This can optimize intermediate data storage and increase the performance.