此去经年应是良辰...吧 关注:27贴子:848

回复:Gonna go through data scientist interview problems today

取消只看楼主收藏回复

Q13. What is the difference between "long" ("tall") and "wide" format data?


16楼2016-07-21 03:37
回复
    Q14. What method do you use to determine whether the statistics published in an article (or appeared in a newspaper or other media) are either wrong or presented to support the author's point of view, rather than correct, comprehensive factual information on a specific subject?


    17楼2016-07-21 03:39
    回复
      Q15. Explain Edward Tufte's concept of "chart junk."


      18楼2016-07-21 03:39
      回复
        16. How would you screen for outliers and what should you do if you find one?


        19楼2016-07-21 03:39
        回复
          Q17. How would you use either the extreme value theory, Monte Carlo simulations or mathematical statistics (or anything else) to correctly estimate the chance of a very rare event?


          20楼2016-07-21 03:40
          回复
            18. What is a recommendation engine? How does it work?


            21楼2016-07-21 03:40
            回复
              19. Explain what a false positive and a false negative are. Why is it important to differentiate these from each other?


              22楼2016-07-21 03:40
              回复
                20. Which tools do you use for visualization? What do you think of Tableau? R? SAS? (for graphs). How to efficiently represent 5 dimension in a chart (or in a video)?


                23楼2016-07-21 03:41
                回复
                  t3 get the answers.


                  24楼2016-07-21 04:41
                  收起回复