How can I get the indexes of the biggest 40 data from a 1D array?

foolooo · ‎06-13-2009

Hi all,

I want to deal with acquired data on the fly, and here is the question. I have a 1D array and I want to get the INDEXES of the biggest 40 data. I can sort the array to get the 40 biggest data, and then search the corresponding indexes of them. But the size of the array is very big and I don't think it's an optimized method. So, is there any better idea? Thank you.

Best wishes,

Bo

------------------------
My blog Let's LabVIEW.

Jaime Rodriguez · ‎06-13-2009

Take a look at the attached code....

altenbach · ‎06-13-2009

Here's a solution that gives the indices of the 40 largest elements.

It can still be improved a little bit, but is certainly faster than your search approach. How big is your array?

(the "solution" in the post above gives the values of the largest 39 elements (not indices!), something completely different)

Message Edited by altenbach on 06-13-2009 08:23 PM

LabVIEW Champion.

altenbach · ‎06-13-2009

altenbach wrote:
It can still be improved a little bit, ....

OK, here's the improvements I made on a quick hunch.

For a 10M array of random data it is about 60x faster (100ms vs 7000ms) than my simple solution above. (it's about 30x faster for a 1M array).

Basically, here we only keep the top 40 elements in an array and keep the currently smallest element in a shift register. If the new element is larger than the currently smallest element, we replace the smallest element by the new element, sort the top 40 array, and tag the currently smallest element. If the new element is smaller, we don't do anything.

You have to be careful, though, because the algorithm is now data dependent and can be slow for certain pathalogical data.

For example if the input is a perfectly sorted array, the penalty is very big, while if we have a reverse sorted array the algorithm is extremely fast. For random input arrays, the chance to find a new top 40 element gets smaller very quicky and we win because we mostly execute the empty case.

Modify as needed. You need to test on your typical data to make sure it is suitable.

I am sure there are even more optimized possibilities, anyone want to try? 😄

Message Edited by altenbach on 06-13-2009 09:20 PM

Message Edited by Support on 06-15-2009 11:35 AM

LabVIEW Champion.

foolooo · ‎06-14-2009

Hi Jaime,

Thank you for your reply, but the result was the 40 biggest value but not indexes.

Regards,

Bo

------------------------
My blog Let's LabVIEW.

foolooo · ‎06-14-2009

Hi altenbach,

Thank you for your replies. And ancturally I got the your previous solution from someone else. But I'm sure your later one is much better. And I'll try that. Thank you for your help!!

Best wishes,

Bo

------------------------
My blog Let's LabVIEW.

altenbach · ‎06-14-2009

If you are looking for trivially simple code, here's yet another alternative. For a 10M array it's about 30x slower than the optimized solution but about twice as fast as my first solution, so it is certainly not competitive. It is probably sufficient for small datasets.

It just shows that there are quite a few ways to do all this. 😉

(slow, but simple!)

Maybe we should do a challenge competition. Who can make the fastest solution? 😄

Message Edited by altenbach on 06-14-2009 09:20 AM

LabVIEW Champion.

Coq_Rouge · ‎06-14-2009

I was just wondering. How will your largest40IndicesFaster.vi cope with several equal number in the array. Will it not fail in such cases. Your last suggestion will cope better with this situation I think.

Besides which, my opinion is that Express VIs ~~Carthage~~ must be ~~destroyed~~ deleted
(Sorry no Labview "brag list" so far)

altenbach · ‎06-14-2009

Coq rouge wrote:
I was just wondering. How will your largest40IndicesFaster.vi cope with several equal number in the array. Will it not fail in such cases. Your last suggestion will cope better with this situation I think.

Well, you can always try, but I don't think it will fail. The array is initialized with all -inf, and the smallest element will remain -inf until the top 40 array is filled with real data. For example if you make an array that is duplicate by appending the input array twice, all solutions will still give the same result.

The results differ slightly if the input has less than 40 elements, because some solutions will be padded to 40 elements with -1 values.

The results may also differ for bad data, e.g. if the input contains NaN.

LabVIEW Champion.

Coq_Rouge · ‎06-14-2009

Well perhaps I am wrong;)

Besides which, my opinion is that Express VIs ~~Carthage~~ must be ~~destroyed~~ deleted
(Sorry no Labview "brag list" so far)

LabVIEW

How can I get the indexes of the biggest 40 data from a 1D array?

How can I get the indexes of the biggest 40 data from a 1D array?

Re: How can I get the indexes of the biggest 40 data from a 1D array?

Re: How can I get the indexes of the biggest 40 data from a 1D array?

Re: How can I get the indexes of the biggest 40 data from a 1D array?

Re: How can I get the indexes of the biggest 40 data from a 1D array?

Re: How can I get the indexes of the biggest 40 data from a 1D array?

Re: How can I get the indexes of the biggest 40 data from a 1D array?

Re: How can I get the indexes of the biggest 40 data from a 1D array?

Re: How can I get the indexes of the biggest 40 data from a 1D array?

Re: How can I get the indexes of the biggest 40 data from a 1D array?