Andy Cotgreave did an amazing Tableau workbook about Seasonality in US road fatalities and it is located here:
It is a cool wonderful vizualization, that’s obvious as he’s a Zen Master. In addition, he went into detail about his design choices. But how did he get all that data (or rather how can you get it)? Well the US has several great government resources for various pieces of data. The National Highway Traffic Safety Administration (NHTSA) has a reporting system called FARS which contains a wealth of data. Now FARS which is short for exactly what it is: Fatality Analysis Reporting System. I would like to personally call out this administration for having an extraordinarily clear name for something. I mean even if you Googled something approximate to those words you would probably find it.
KNIME came out with some new features, several of these were related to grabbing files. No longer do you have to program it in Java or Python. KNIME now offers an easy way to do it. What I like the most about KNIME is it is essentially self-documenting. This data flow clearly shows me what is happening, plus allowing for some annotation.
The settings of the nodes used:
Node 63: Change directory to /FARS/
Node 64: This extracts the Uniform resource identifier (URI) information. Just make sure “Path” is checked.
Node 65: This String Manipulation module adds in string($Path$+”/DBF/”), this is just going to append a 2nd subdirectory, that thankfully is consistent throughout all files.
Node 67: No changes.
Node 61: Download. Show the variable ports, and take the directory you constructed with the above nodes and feed it into the source file or folder. Change your target folder to wherever you want the data to be downloaded. THIS IS VERY IMPORTANT. Do not download all files, as it is a waste to grab things you don’t need. Make sure you check “Only download files that match” and then choose F*.zip and a wildcard. This will limit the number of downloaded files. Also, if you are only interested in a subset of the years then add a 2nd string manupulation, use “substr($Path$,6 , 4)” and a row filter node to remove unneeded folders from the list.
Node 68: No changes.
“1 Row Test Case” This node can replace node 67 if you don’t want a loop and just want to download 1 at a time.