Stripping Zeros using all cores of the processor

bmann2000 · ‎03-20-2017

I have a machining task, the designs arrive as 2D arrays but I don't want to waste machine time by machining elements of zero depth. Stripping the zeros is slow, the operator may have to wait up to 30 minutes for LabVIEW to finish stripping the zeros. Can someone please show me how to use all 4 cores of the processor to speed up the process. VIs attached. I've included an example plot that shows desired input and output of the VI.

Ben · ‎03-20-2017

Start by making "IndexOfZeros.vi" re-entrant and try it again.

Ben

Retired Senior Automation Systems Architect with Data Science Automation LabVIEW Champion Knight of NI and Prepper LinkedIn Profile YouTube Channel

Hooovahh · ‎03-20-2017

Not having terminals on the root of the diagram can cause inefficiencies.

Having the VI not be reentrant can be limiting.

Disabling automatic error handling and debugging can help performance.

I also made an improvement where if the line is all zeros then there is no need to perform search on the array that is relatively slow. Attached is an example that does a speed test on the original and my method which is about 5-8 times faster with these changes. What is considered slow? Even the original method is usually less than 1ms on my machine.

Unofficial Forum Rules and Guidelines
Get going with G! - LabVIEW Wiki.

16 Part Blog on Automotive CAN bus. - Hooovahh - LabVIEW Overlord

altenbach · ‎03-20-2017

What are typical sizes of the 2D array? (30 minutes sound way excessive for the current 60x200 input!)
Why is the array double if it is only black and white? U8 would be sufficient! (8x less memory!)
What is the CPU use (by LabVIEW) during these 30 minutes?
Parallelization will only give you probably less than a factor of four, Maybe tweak the algorithm instead?
What is your processor?
How much RAM do you have?
Is this LabVIEW 32 or 64 bit?
Did you do some benchmarking analysis, e.g. what part is the slow step?
Why do you have debugging enabled? You should inline the subVI.
Make sure that the front panel of the subVI is closed when testing.
Your example 2D output array has exactly the same size as the input array, so you should be able to operate "in place" No resizing needed. All it does is shift rows to the left, which can be done in-place.

LabVIEW Champion.

bmann2000 · ‎03-21-2017

thanks for the replies and speed pointers, I used Hoovaah's code with a large circular array to benchmark zero-stripping of a large file. I had incorrectly thought the zero stripping was the time consuming VI, it was actually the file write and indicators on the front panel.

I replaced my file write scheme using the basic blocks of the WriteToSpreadsheetFile.vi as an example.

In the end I read in a 2D array of 221,893KB from file, stripped and saved back to file in 55 seconds, which would be fine for the very largest of files.

altenbach · ‎03-21-2017

@bmann2000 wrote:

thanks for the replies and speed pointers, I used Hoovaah's code with a large circular array to benchmark zero-stripping of a large file. I had incorrectly thought the zero stripping was the time consuming VI, it was actually the file write and indicators on the front panel.

I replaced my file write scheme using the basic blocks of the WriteToSpreadsheetFile.vi as an example.

If speed matters, use binary files instead. Orders of magnitude faster than scanning and formatting numbers as text.
How many values are possible as array element. Is DBL really needed? If writing formatted values, make sure to adjust the format code to limit the number of characters. (For integers, you can even omit the decimal point).
It is completely useless to use array subset inside the FOR loop, because all short arrays get padded again at the output. LabVIEW does not support ragged 2D arrays! All you are doing is create many steps where memory allocations need to occur. Ultimately, the array being built in the shift register needs to be resized whenever a new, longer row is found. Completely silly and inefficient! (you are very lucky that the first row is full size!)
All you need to do for each row is find the first nonzero element and rotate the array accordingly. This can be done in-place.
Ideally the machining code should be smart enough to start at the first nonzero element so that no pre-processing is even needed.
Please rethink this from scratch. Currently, you are just digging yourself into a deeper hole.

LabVIEW Champion.

altenbach · ‎03-21-2017

@altenbach wrote:

All you need to do for each row is find the first nonzero element and rotate the array accordingly. This can be done in-place.

For example, the following code gives the same result (on the sample array) and uses significantly less code (where bugs can hide!) than Brian's. (I am sure it can be improved further!). Since the first nonzero element occurs relatively early, it is not necessary to allocate that large boolean array and do many more comparisons with zero. (yes, the result will be slightly different if all rows are shorter).

LabVIEW Champion.

bmann2000 · ‎03-22-2017

The attached VI shows more clearly the problem, which I think we've solved. 20MB version of file attached due to upload limit. The last time I stripped zeros in a project it was minutes worth of 2GHz multi-channel sampling, so this time I incorrectly though the zero strip was the issue. The attached code demonstrates that file write text versus binary is the factor that slows the code down, in this section anyway, I have lots of other 2D array processing going on elsewhere in the application that may become the topic of another post.

Altenbach has touched on the bigger question of should I be doing this in the first place. I'm currently rewriting an application that hangs up on an out-of-memory error when processing a batch of large files typical of those in the example code. A large input file is typically 200MB of text which gets converted to DBL when read. Each line of the 2D array represents a line to be machined but it must be combined with transformation matrices, motion parameters, calibrations, timing triggers etc to produce machining data that can be fed to the hardware that performs the machining. In the old application that runs out of memory, the process of converting the 200MB of depth information to machining data causes the array to be copied in memory many times leading to the out of memory error. It also wastes time by always machining a rectangle even if the part is circular.

To get round these two limitations, I'm rewriting the application. Instead of manipulating a huge 2D array, I plan to process a line-at-a-time, buffering each line from file during the machining process. As each line is a different length, I figure that processing a line-at-a-time from file is preferable to using a cluster of arrays.

I also figure that the resulting text sequencer file will be easy to debug prior to run-time, then when the application is fully debugged, I can switch the file type to binary to speed up the pre-processing step.

altenbach · ‎03-22-2017

You don't need to set the file position over and over, the default is to start writing at the current position.
Writing binary will not work, because there are no delimiters and there is no way to tell when one row ends and the next starts (the have unequal length after the subsetting).
After he initial read, taking the subset is probably simpler than deleting twice from the array

LabVIEW Champion.

bmann2000 · ‎03-23-2017

Okay thanks, so when I switch over to binary after debug, I should zero pad to make each line the same length. e.g. Integer_LineLength,deptht0,depth1...depthN,pad0,pad0,pad0.

The design should stick with 2D arrays but ignore the zeros on the end. It still makes sense to get rid of the zeros at the start of the line and move them all to the end, otherwise it gets complicated as I'm calculating the start of line coordinates, run-in-distance, acceleration etc based on the coordinates of the first non-zero depth in the array. I don't want to deal with the XY coordinates of the zero-depth locations.

LabVIEW

Stripping Zeros using all cores of the processor

Stripping Zeros using all cores of the processor

Re: Stripping Zeros using all cores of the processor

Re: Stripping Zeros using all cores of the processor

Re: Stripping Zeros using all cores of the processor

Re: Stripping Zeros using all cores of the processor

Re: Stripping Zeros using all cores of the processor

Re: Stripping Zeros using all cores of the processor

Re: Stripping Zeros using all cores of the processor

Re: Stripping Zeros using all cores of the processor

Re: Stripping Zeros using all cores of the processor