In the last step, we downloaded all of our data and deposited into directories that store this source data, backed it up, and write-protected the files. Now that we have done all of that, it is time to start working with the data! There is only one problem: almost inevitably, the data do not come neat, tidy, and ready to use. Often, the data contain major problems and need to be constructed in order to be usable. In this installment, I will write about managing files for cleaning, constructing and storing datasets.
Blog
Posts from April 2011
Structuring Work: Data Cleaning and Construction, Laying the Foundation
Saturday, April 16th, 2011 11:37a.m.
Front Page
About
- Information about the purpose and topics of this blog can be found here.
Feeds
Archive
- Oct 2011
- Aug 2011
- Jul 2011
- Jun 2011
- Apr 2011
- Mar 2011
- Feb 2011
- Oct 2010
- Sep 2010
- Jul 2010
- Jun 2010
- May 2010
- Apr 2010
- Feb 2010
Categories
Tags
- advice
- architecture
- blogs
- built-environment
- cities
- data
- data-management
- data-visualization
- David-Kindig
- demography
- disorder
- gabriel-rossman
- gentrification
- grants
- graphics
- grocery
- health-policy
- immigration
- inequality
- Jon-Stewart
- kriging
- macros
- measurement
- National-Grocers-Association
- neighborhood-effects
- neighborhoods
- nutrition
- obesity
- orgtheory
- PAA
- peer-review
- personal
- population-health
- public-health
- rejection
- research-design
- research-process
- residential-mobility
- segregation
- Stata
- statistics
- strings
- suburbs
- teaching
- The-American-Prospect
- This-American-Life
- tips-n-tricks
- urban-policy
- whole-foods
- WNYC
- workflow
Miscellany
- The views presented here are solely and entirely my own, they do not represent those of my colleagues, employer, or any funding agencies which may support me.
- The writing on this blog is covered by a Creative Commons License (described here). Feel free to distribute or re-post with a link to the original content provided that it is freely available to others.
