======================== Data Wrangling & Record Linkage Resources ======================== Resources --------- -- Home Mortgage Disclosure Act example from class * `Code `_ * `Sample data `_ -- `John Canny's slides on Data Cleaning and Integration `_ -- `A Theory For Record Linkage by Ivan Fellegi and Alan Sunter `_ -- `Detaled discussion of Levenshtein Distance `_ -- `Notes on Record Linkage presented in classs <./record_linkage_notes.html>`_ -- Useful book: *Big Data and Social Science: A Practical Guide to Methods and Tools (Chapman & Hall/CRC Statistics in the Social and Behavioral Sciences)* by Foster, Ghani et al. On reserve in Regenstein for the quarter. See Chapter #3 on Record Linkage. -- `Additional scripts presented in class on Piazza `_