central limit theorem

ahmedhamdy2010 · ‎04-08-2015

hi all,

i have made a small VI to apply the central limit theorem, it works very good except that it is slow, i know i can use a more powerful computer and it will be faster, but i want to optimize this vi for minimum time.

anyone can help?

what i do in this vi: 1-generate N (usualy 30) unigue random positive integer numbers

2-i use this 30 integers as indices to select 30 random elements from an array X (which has at least 40 samples)

3- i get the average of this 30 randomly selected elements from array X

4- i repeat the previous 3 steps at least 10^6 times.

5- i draw a histogram.

is there a way to make this vi faster (as possible) , even if i have to change the algorithm, as long as it does the same.

thank you

Oligarlicky · ‎04-08-2015

I couldn't quite figure out what your while loop was doing. Here's a stab at it using the sort function on an array of clusters which sorts on the first element.

ahmedhamdy2010 · ‎04-08-2015

thanks for your help, but still something is missing, the output is not gaussian as expected

i generate 30 random numbers that must be unique, so, i generated just 30 random numbers, then i removed the duplicates, therefore the output is less than 30 random numbors, that is why i used a while loop to fill the array with extral elements to be exactly 30 random uniqe integers.

because i want to extract 30 different but not repeated elements from array X and average them

thanks 🙂

altenbach · ‎04-08-2015

Here's what I would do.

For 1M samples, it takes abour 2s on my computer.

If you parallelize the outer loop, it will be proportionally faster (~0.15s total on my 16 core dual Xeon :D)

Code assumes that the input array is >30 and that you want to pick 30 random elements. Change to diagram constant wired to N of the inner loop to change the pick size (or change to a control on the toplevel diagram).

LabVIEW Champion.

ahmedhamdy2010 · ‎04-08-2015

@altenbach wrote:

Here's what I would do.

For 1M samples, it takes abour 2s on my computer.

If you parallelize the outer loop, it will be proportionally faster (~0.15s total on my 16 core dual Xeon :D)

Code assumes that the input array is >30 and that you want to pick 30 random elements. Change to diagram constant wired to N of the inner loop to change the pick size (or change to a control on the toplevel diagram).

thank you very much altenbach, this is just perfect 🙂

altenbach · ‎04-08-2015

Alternatvely, you could just shuffle the orange array and use the "riffled index" output. Seems to be about the same speed (even though we riffle a larger datatype). Less code. 😄

(There are a couple of other little tweaks that can gain you another 15% or so. Divide the raw array by 30 and just sum in the loop (instead of taking the mean). inline riffle (make a copy first!), etc.)

LabVIEW Champion.

JÞB · ‎04-09-2015

@altenbach wrote:

Alternatvely, you could just shuffle the orange array and use the "riffled index" output. Seems to be about the same speed (even though we riffle a larger datatype). Less code. 😄

(There are a couple of other little tweaks that can gain you another 15% or so. Divide the raw array by 30 and just sum in the loop (instead of taking the mean). inline riffle (make a copy first!), etc.)

Christian, I think you are hitting a optomization with that.... The mean is the mean regardless of order of the array elements. I would suspect the inner loop is absent in the compiled object and the array is simply sliced into sections.

(Didn't I learn a sorted riffled array is insane a while back?) That whole pallete is suspicious in benchmarks and scares me a little with a smart compiler.

"Should be" isn't "Is" -Jay

Oligarlicky · ‎04-09-2015

Of course, if we're riffling the double, we might as well get rid of the for loop. It's no faster but it's kinda nice and compact.

altenbach · ‎04-09-2015

Right. Of course I still feel it's slightly more expensive to riffle DBL instead of I32. 😉

LabVIEW Champion.

altenbach · ‎04-09-2015

Just tested again, and riffling the I32 array followed by an inner FOR loop is about twice as fast when parallelized (0.15s vs 0.3s) compared to truncating the riffled DBL array. YMMV. 😉

LabVIEW Champion.

LabVIEW

central limit theorem

central limit theorem

Re: central limit theorem

Re: central limit theorem

Re: central limit theorem

Re: central limit theorem

Re: central limit theorem

Re: central limit theorem

Re: central limit theorem

Re: central limit theorem

Re: central limit theorem