Recursive Spark SQL

Spark is an efficient and easy-to-use framework for solving big data problems, and its Dataframe API allows solutions to be formatted in a familiar SQL syntax. However, the Dataframe API does not support the recursive SQL functionality, which is often used for a class of graph and tree algorithms. Let’s look at an example problem and how we might solve it using recursive SQL, and then think about how to translate this into a Spark program....

January 4, 2023 · 9 min · 1781 words · Bradley Stevanus