top | item 40004882

(no title)

bagrow | 1 year ago

The best way to compute the empirical CDF (ECDF) is by sorting the data:

    N = len(data)
    X = sorted(data)
    Y = np.arange(N)/N
    plt.plot(X,Y)
Technically, you should plot this with `plt.step`.

discuss

order

andrewla|1 year ago

scipy even has a built-in method (scipy.stats.ecdf) for doing exactly this.

vvanirudh|1 year ago

Neat! That is so simple and in hindsight, makes a lot of sense. Thanks!