Python question list

  1. if we have probability for each value, how to see the value’s distribution? We can use histgram to see the possible pdf (probability density function) overall. Then using KDE (Kernel Density Estimation) to fit pdf. Reference as follows. https://jakevdp.github.io/blog/2013/12/01/kernel-density-estimation/
  2. Do you use random forest? If so, what’s entropy?
    To measure the quality of a split:
    Gini impurity.
    Entropy for the information gain. Need to explain it in detail.
  3. If there is a .csv file, and I want the results to be a table like this (example given), what kind of data structure you would like to use to store the data?

    NA

Leave a Comment