Distribution of a list of numbers

asked 2012-02-03 13:03:20 +0200

293 ●18 ●32 ●40

updated 2014-10-28 21:15:16 +0200

kcrisman
12252 ●42 ●136 ●255

Hello! I'd like to know if there is a Sage function to retrieve a distribution of some experimental data. I have a 1D list of measured values and I need to obtain a second list containing the distribution of that data, for example, to fit it with some Gaussian or Poisson curve.

Thanks.

edit retag flag offensive close merge delete

add a comment

3 Answers

Sort by » oldest newest most voted

answered 2012-02-04 09:20:42 +0200

Jason Grout
3515 ●9 ●36 ●81

The numpy histogram function will bin values together: http://docs.scipy.org/doc/numpy/refer...

The Sage Timeseries histogram function also bins data: http://www.sagemath.org/doc/reference...

edit flag offensive delete link

add a comment

answered 2012-02-03 17:37:23 +0200

Shashank

1927 ●32 ●55 ●86

updated 2012-02-04 14:02:19 +0200

I use matplotlib for that purpose. Have a look at the first example on the page

http://matplotlib.sourceforge.net/exa...

Edit: To get a list use

import numpy as np
import pylab as P


mu, sigma = 200, 25
x = mu + sigma*P.randn(10000)


n, bins, patches = P.hist(x, 50, normed=1, histtype='stepfilled')
P.setp(patches, 'facecolor', 'g', 'alpha', 0.75)


y = P.normpdf( bins, mu, sigma)
l = P.plot(bins, y, 'k--', linewidth=1.5)
print bins

edit flag offensive delete link

Comments

Oh, I see. Indeed, it may be a good solution. But what do you import Numpy for in this case? Thanks!

v_2e ( 2012-02-05 07:42:59 +0200 )edit

Sorry I edited an example file from net and forgot to delete the line

Shashank ( 2012-02-05 13:44:03 +0200 )edit

add a comment

answered 2012-02-04 08:03:23 +0200

v_2e

293 ●18 ●32 ●40

I used the hist() function from Matplotlib also, but does it allow to store the distribution in the list for? The problem is that I need to do some fitting to this distribution, and not only to visualize it.

I have found a way to achieve approximately what I need using the histogram() function from scipy.stats module.

Something like this:

from scipy.stats import histogram
distribution_list = histogram(data_list, numbins=10, defaultlimits=(90,164))

It gives the values for the bins, the first bin start and the bin width. To obtain the distribution list with the actual values for further work (like fitting, plotting, adding to something, etc.) one can do something like this:

from scipy.stats import histogram
distribution_list = histogram(data_list, numbins=10, defaultlimits=(90,164))

actual_distribution = []
i = 0
for bin in distribution_list[0]:
    actual_distribution.append((distribution_list[1]+i*distribution_list[2], bin))
    i+=1

But maybe there are some other (more simple or built-in or smarter) ways to get a distribution out of a list of numbers?

Thanks.

edit flag offensive delete link

add a comment

Your Answer

Please start posting anonymously - your entry will be published after you log in or create a new account.

Add Answer

Distribution of a list of numbers

3 Answers

Comments

Your Answer

Question Tools

Stats

Related questions

Distribution of a list of numbers edit

3 Answers

Comments

Your Answer

Question Tools

Stats

Related questions

Distribution of a list of numbers