07-31-2007 03:55 PM
07-31-2007 04:10 PM - edited 07-31-2007 04:10 PM
Message Edited by Ravens Fan on 07-31-2007 05:10 PM
07-31-2007 04:17 PM
07-31-2007 08:10 PM
@Frank Rizzo wrote:
Well I'm sitting in Cleveland Ohio right now so I am going to have to disregard your message.....hahhahaha...\
I'll be curious to see if Jamal Lewis's yardage improves for you guys this year after having some definite dropoffs with us the last couple.
Though now that he's not playing against you, your defense's numbers against the run should improve.
I'd be surprised if you could get that data to the researchers at all using Excel. It has a limit of 256 columns and 65536 rows. If you had a column per channel and a row per data point, you'd be talking 315 columns and 312,000 rows for 4 minutes of data. I guess you could always break it up into several files being sure to leave some rows and columns to give them a chance to do some data calculation.
Out of curiosity, I created a spread sheet where every cell was filled with a 1. It took a good 30 seconds to save and was over 100 MB. That was probably about 1/10 of the amount of data your dealing with. And going back to my earlier calculations, I would guess that as a text file, that much data would need about 10 bytes per value thus getting you to about 1 GB in text files.
I would ask them what kind of software they will use to analyze these files and what format they would prefer it in. There are really only 2 ways to get it to them, either ASCII text file which could be very large, but would be the most flexible to manipulate. Or a binary file, which would be smaller, but there could be a conflict if they don't interpret the file the same way you write it out.
I haven't used a TDM file add-in for Excel before. So I don't know how powerful it is, or if there is a lot of overhead involved that would make it take 3 hours. What if you create multiple TDM files? Let's say you break it down by bunches of channels and only 20-30 seconds of data at a time. Something that would keep the size of the data array within the row and column limits of Excel. (I am guessing about 10 files). Would each file go into Excel so much faster with that add-in that even if you need to do it 10 times, it would still be far quicker than one large file? I am wondering if the add-on is spending a lot of time figuring out how to break down the large dataset on its own.
The only other question. Have you tried the TDMS files as opposed to the TDM files? I know they are an upgrade and are supposed to work better with streaming data. I wonder if they have any improvements for larger datasets.
07-31-2007 08:44 PM
07-31-2007 08:48 PM - edited 07-31-2007 08:48 PM
Message Edited by Underflow on 07-31-2007 07:52 PM
07-31-2007 09:08 PM
@Underflow wrote:
Hi Ravens Fan,
To the horror of programmers everywhere, Excel 2007 now supports something like 64k columns and 1M rows.
We've actually kept that to ourselves where I work. We already get spreadsheets with 20 tabs, 255 columns, and 50k rows. Extending that further is... painful to contemplate.
Joe Z.
Message Edited by Underflow on 07-31-2007 07:52 PM
I did not realize that. Honestly, I didn't even realize there was an Excel 2007 out. I think it is long overdue in terms of expanding the worksheet space, but you're right, that can cause a lot of headaches. Like they say, trash will expand to fill the amount of available space. Our company has just about finished upgrading all but the oldest, lowest denominator PC's from Win2000 to XP. And the office package I'm running has Excel 2002. It will probably be another year or two before they start installing Vista on the newest of PC's. Office packages will probably be upgrading about the same time.
I realize I made an error in my earlier statement. I said that double precision would be 4 bytes per value, but it is actually 8 bytes, so all my calculations would be twice as large. They would be accurate for single precision numbers. But I was just trying to put a sense of scale to the massive amount of data that was being generated.
08-01-2007 02:04 AM
08-01-2007 08:55 AM
Thanks to everyone for all the good information. I'll let you know what I end up doing. If there are any other ideas or past experiences anyone has to share I would welcome them. Thanks!
08-01-2007 09:36 AM