Archives for Apache Spark - Page 2

24 Dec

Top 8 Alternatives To Apache Spark

Launched in the year 2009, Apache Spark is an open-source unified analytics engine for large-scale data processing. With more than 28k GitHub stars, this analytics engine can be said as one of the most active open-sourced big data projects and is popular for its various intuitive features. Some of its features include ease of writing…

The post Top 8 Alternatives To Apache Spark appeared first on Analytics India Magazine.

24 Dec

Top 8 Alternatives To Apache Spark

Launched in the year 2009, Apache Spark is an open-source unified analytics engine for large-scale data processing. With more than 28k GitHub stars, this analytics engine can be said as one of the most active open-sourced big data projects and is popular for its various intuitive features. Some of its features include ease of writing…

The post Top 8 Alternatives To Apache Spark appeared first on Analytics India Magazine.

31 Aug

Python Vs Scala For Apache Spark

Apache Spark is a popular open-source data processing framework. This widely-known big data platform provides several exciting features, such as graph processing, real-time processing, in-memory processing, batch processing and more quickly and easily. With the expansion of data generation, organisations have started utilising these vast amounts of data to gain meaningful insights. Big data tools…

The post Python Vs Scala For Apache Spark appeared first on Analytics India Magazine.

30 Dec

Top 11 Tools For Distributed Machine Learning

image-9199
image-9199

There are two fundamentally different and complementary ways of accelerating machine learning workloads:  By vertical scaling or scaling-up, where one adds more resources to a single machine  Or  2. By horizontal scaling or scaling-out, where one adds more nodes to the system But when it comes to the degree of distribution within a machine learning…

The post Top 11 Tools For Distributed Machine Learning appeared first on Analytics India Magazine.

11 Sep

Python Vs Scala: Which Language Is Best Suited For Data Analytics?

image-7047
image-7047

Python and Scala are two of the most popular languages used in data science and analytics. These languages provide great support in order to create efficient projects on emerging technologies. In this article, we list down the differences between these two popular languages. Python  Python continues to be the most popular language in the industry.…

The post Python Vs Scala: Which Language Is Best Suited For Data Analytics? appeared first on Analytics India Magazine.

04 Apr

4 Programming Languages Every Big Data Enthusiast Must Ace

image-3760
image-3760

Every programming language has its own set of features. If one wants to pioneer in any particular domain of technology, it is very crucial to have a strong command over any programming language. Coding can be said as the most initial and primary thing in a developer’s toolkit. It is widely known that Python is…

The post 4 Programming Languages Every Big Data Enthusiast Must Ace appeared first on Analytics India Magazine.

29 Mar

Top 10 Books For Learning Apache Spark

image-3627
image-3627

Apache Spark is one of the most active open-sourced big data projects. It is fast, flexible, and scalable, which makes it a very popular and useful project. In this article, we jot down the 10 best books to gain insights into this general-purpose cluster-computing framework. 1| Beginning Apache Spark 2: With Resilient Distributed Datasets, Spark…

The post Top 10 Books For Learning Apache Spark appeared first on Analytics India Magazine.

29 Mar

Top 10 Books For Learning Apache Spark

image-3628
image-3628

Apache Spark is one of the most active open-sourced big data projects. It is fast, flexible, and scalable, which makes it a very popular and useful project. In this article, we jot down the 10 best books to gain insights into this general-purpose cluster-computing framework. 1| Beginning Apache Spark 2: With Resilient Distributed Datasets, Spark…

The post Top 10 Books For Learning Apache Spark appeared first on Analytics India Magazine.

28 Mar

Apache Spark Turns 10: The Secret Sauce Behind One Of The World’s Most Popular Open Source Projects

image-3601
image-3601

It was the changing nature of big data technology and architectural models, that wrote the story for Hadoop. The infrastructure architecture moved towards edge computing, IoT and cloud computing and especially containers where the market is seeing an increase in Kuberenetes workload. With analytical and machine learning workloads increasing, there was an increased need for…

The post Apache Spark Turns 10: The Secret Sauce Behind One Of The World’s Most Popular Open Source Projects appeared first on Analytics India Magazine.