Spark has many advantages, but it has certain
limitations
as well, some of them are:-
In apache, the spark developer needs to
optimize
their code manually since there is no automatic optimisation.
No automatic optimization
Spark does not have any
file
system
, and it relies on Hadoop or object storage for storing the data.
No file system
Similar to Hadoop, spark also has
small
files
issue
. Spark performance degrades when it has to process many small files.
Small file issue
Since spark is an in-memory computation engine, it requires
massive
RAM
for the processing ,which adds cost.
Expensive
Apache spark has high
latency
as compared to apache spark.
Latency
Click to learn more