Press J to jump to the feed. Press question mark to learn the rest of the keyboard shortcuts
Log In
Found the internet!

Data Science

r/datascience

Posts
Read the Wiki
8
pinned by moderators
Posted by2 days ago
8
49 comments
70
Posted by3 hours ago
70
21 comments
278
Posted by23 hours ago

I wanted to contribute yet another post which is more on the cynical side regarding data science as an industry. I know that many people lurking here are trying to draw up pros and cons lists for going into the industry. This is a contribution to the cons column.

My current gripe with DS is that I have lost faith that the industry will ever be able to absorb data-driven decision making as a culture. For a long time, I thought that it's more about improving my communication skills, creating explainers on how the models work, or just waiting for the world to 'catch-up' to data science. These techniques were new and complex, after all - it would take some time for the industry to adjust, as a Gartner article might tell you. But those businesses which did adjust would do better over time, and the market would force others to compete.

This line of thinking completely falls apart once you go into the history of 'quantitative methods' in business decision making. DS is really just the latest in a long line of attempts at doing this stuff including:

  • Quantitative Methods

  • Operations Research

  • Management Science (Rebranded Operations Research)

  • Business Intelligence

  • Data Mining

  • Business Analytics

All these fields are still around, of course. But they tend to occupy a particular niche, and their claims to radically transform the business world are gone. They aren't the 'sexiest job of the 21 century". People have been trying to do this whole "Business, but with Models!" thing for years. But it never really caught on. Why?

DS is just hype, and the hype cycle for DS will implode and not recover. Or it will recover to the same level that these other techniques did.

Data Science isn't better than any of those other disciplines. Here is my response to some objections:

  • Maybe they weren't adding real business value? Crack open the average Operations Research / Management Science textbook and I guarantee you you'll find problems which are more business-focused than anything you'll find on Towards Data Science or a DS textbook. They developed remarkable models to deal with inventory problems, demand estimation, resource planning, scheduling problems, forecasting and insights gathering - and most of their models were even prescriptive and automated using Optimization solvers.

278
181 comments
Vote
Posted by8 minutes ago
Vote
1 comment
Vote
606
Posted by1 day ago

I work for a mid size company as a manager and generally take a couple of interviews each week, I am frankly exasperated by the shockingly little knowledge even for folks who claim to have worked in the area for years and years.


  1. People would write stuff like LSTM , NN , XGBoost etc. on their resumes but have zero idea of what a linear regression is or what p-values represent. In the last 10-20 interviews I took, not a single one could answer why we use the value of 0.05 as a cut-off (Spoiler - I would accept literally any answer ranging from defending the 0.05 value to just saying that it's random.)

  2. Shocking logical skills, I tend to assume that people in this field would be at least somewhat competent in maths/logic, apparently not - close to half the interviewed folks can't tell me how many cubes of side 1 cm do I need to create one of side 5 cm.

  3. Communication is exhausting - the words "explain/describe briefly" apparently doesn't mean shit - I must hear a story from their birth to the end of the universe if I accidently ask an open ended question.

  4. Powerpoint creation / creating synergy between teams doing data work is not data science - please don't waste people's time if that's what you have worked on unless you are trying to switch career paths and are willing to start at the bottom.

  5. Everyone claims that they know "advanced excel" , knowing how to open an excel sheet and apply =SUM(?:?) is not advanced excel - you better be aware of stuff like offset / lookups / array formulas / user created functions / named ranges etc. if you claim to be advanced.

  6. There's a massive problem of not understanding the "why?" about anything - why did you replace your missing values with the medians and not the mean? Why do you use the elbow method for detecting the amount of clusters? What does a scatter plot tell you (hint - In any real world data it doesn't tell you shit - I will fight anyone who claims otherwise.) - they know how to write the code for it, but have absolutely zero idea what's going on under the hood.

There are many other frustrating things out there but I just had to get this out quickly having done 5 interviews in the last 5 days and wasting 5 hours of my life that I will never get back.

606
567 comments
Vote
Posted by55 minutes ago
Vote
7 comments

About Community

A place for data science practitioners and professionals to discuss and debate data science career questions.
Created Aug 6, 2011

932k

Members

615

Online

Top 1%

Ranked by Size

r/datascience Rules

1.
Be Fair. Be Patient. Be Helpful.
2.
Stay On Topic
3.
Use the Weekly Thread
4.
No Video Links
5.
No Listicles
6.
No Surveys
7.
Limit Self-Promotion
8.
/r/datascience is not stack overflow
9.
/r/datascience is not a homework helper
10.
/r/datascience is not a crowd-sourced Google
11.
Memes are only allowed on Mondays

Related Communities

r/MachineLearning

2,687,680 members

r/learnmachinelearning

310,509 members

r/statistics

537,020 members

r/AskStatistics

69,974 members

r/learnpython

719,637 members

r/rstats

72,660 members

r/cscareerquestions

1,024,913 members

r/dataengineering

112,781 members

Moderators

Moderator list hidden. Learn More