What is Spark?
è Apache
Spark is an open-source cluster computing framework. Its primary purpose is to
handle the real-time generated data.
è Spark
is a Bigdata Processing Framework.
è Spark
software is written in Scala Language.
è Spark
Applications we can implement using Java, Scala, Python and R Language
è Spark
has only processing. It does not have storage.
è Spark can process any data.
è Spark was built on the top of the Hadoop MapReduce. It was optimized to run in memory whereas alternative approaches like Hadoop's MapReduce writes data to and from computer hard drives. So, Spark process the data much quicker than other alternatives.