Posts about Apache Spark
![Subreddit Icon](http://web.archive.org./web/20230306222111im_/https://styles.redditmedia.com/t5_2qh37/styles/communityIcon_kge80txqeqf11.png)
![Subreddit Icon](http://web.archive.org./web/20230306222111im_/https://styles.redditmedia.com/t5_36en4/styles/communityIcon_t74nv7kttaz61.png)
![Subreddit Icon](http://web.archive.org./web/20230306222111im_/https://styles.redditmedia.com/t5_2sptq/styles/communityIcon_fdj8zurrifa71.png)
![Subreddit Icon](http://web.archive.org./web/20230306222111im_/https://styles.redditmedia.com/t5_2fwo/styles/communityIcon_1bqa1ibfp8q11.png)
![Subreddit Icon](http://web.archive.org./web/20230306222111im_/https://styles.redditmedia.com/t5_2r3gv/styles/communityIcon_kilpomt3l5c51.png)
![Subreddit Icon](http://web.archive.org./web/20230306222111im_/https://styles.redditmedia.com/t5_2qh0y/styles/communityIcon_h9cdwd9m75a51.png)
![Subreddit Icon](http://web.archive.org./web/20230306222111im_/https://styles.redditmedia.com/t5_2qh84/styles/communityIcon_pc026nky6a221.png)
![](http://web.archive.org./web/20230306222111im_/https://www.redditstatic.com/desktop2x/img/renderTimingPixel.png)
I am checking stack overflow survey 2022 (https://survey.stackoverflow.co/2022/#top-paying-technologies-other-frameworks-and-libraries), and I see that apache spark is the highest paying framework under other frameworks category. I want to upskill myself and to be more demanded in job market. So is it worth learning Apache Spark (PySpark) in 2023?
The awesome-spark repo has a list of Spark OSS libraries, but a lot of them are quite old.
I am thinking about curating another list that's a bit more focused and updated. For example, I'm interested in knowing all the good data validation libraries that are currently being maintained for PySpark right now. It's hard to figure out the best options.
Feel free to add the libraries you like a lot in the comments and I'll try to collate a list.