JiGLE is an open source java client available on the
Downloads Page, which with little effort you can get CSV data from. Not sure where the "can't get historical data" idea comes from, you will likely get that first in a dense area. There are some (rather high) limits to prevent continuous beating on the WiGLE servers, which a researcher is unlikely to hit.
If JiGLE can do this, great; can you point out where it can, I'm happy to RTFM? Feedback from a user trying to do just what you say - there is no apparent documentation that I can find for JiGLE other than the README.txt file, which doesn't mention anything about exporting or saving data to any formats, CSV or otherwise. The Menu options within the program don't appear to offer anything - file authenticates or connects, view... well, views, filter changes the display, preferences is blank and stats brings a popop with an overview. Help brings up a version number. Searching google for "JiGLE documentation", export, CSV and the like appears to show a lot about how to use it to import data. Is there a page or something that describes any of this?
Can't get historical data. OK - I live in seattle, WA. I type in WA into the search engine, and the 3rd entry that comes up is 06:24:36:AC:DC:15. The search tool tells me that the first time it's seen this MAC/ was 2010-06-13 22:12:01, and the last time was 2011-02-20 18:12:29. It then proceeds to list various bits of data about the network. But without a timestamp on the data, how am I supposed to know when the fields were populated, or if they've changed? For instance, take the channel - according to this, it's 8. Was this from 2010/6/13, 2011/2/20, or something inbetween? Has the channel changed during that time? Was it seen in every import of data in this area? What HW/SW tools were used to generate the data - different scanners have different behaviors.
The data is potentially incomplete, too - for instance on an iphone is has a "Confidence" field - is this in there anywhere, or has it been stripped and discarded?
It appears that an enormous amount of effort has been put into displaying the uploaded data to a certain type of user, and I applaud that. But I read that > 30m networks have been uploaded to your database, and I'd like to see the rest of it. Database size-wise that's a very small amount of data, and I could easily consume the entire amount - your comment about being unlikely to hit the limits is pretty laughable, I want more than what you have, not less.
I can appreciate concerns for hammering your systems being taxed or possible bandwidth issues or anything else. But you appear to have some useful data somewhere, and there are people out here who would like to use it in ways that you might not anticipate or think are useful. If you don't care to share it, that's your prerogative, but if not it'd be nice to have a policy or document that states such a thing. You might consider asking if others would like to mirror the data as well, that might relieve some of the stress on your own systems.
p.s. Your method of grabbing consolidated.db will not work on all macs. I'll write something about this in a separate post. You might consider having some additional forum categories, as it's not really a project suggestion bit... maybe have a tech forum place?