Events and Activities
Also see our list of workshops.
Flipped classrom about access to restricted access data. Students will be introduced to the research proposal mechanism of the Federal Statistical Research Data Center, including data from the Census Bureau, NCHS, and BLS.
Discussion will focus on how to access various restricted access data sets. Guest presenters may be present live in the videoconference classroom.
Part 3 switches gears, and discusses the need for and the requirements of replicable science (in general, and in restricted-access environments). This part is a live lecture by Lars Vilhuber.
- Restricted Access Data: INFO7470-S8-Proposals, Kristen Monaco on BLS proposal review, Matthias Umkehrer on IAB access
- Replicable Science: INFO7470-S9-Replicable Science
- Updates and Flipped Class questions: INFO7470-S8-Updates and flipped class questions
The class is flipped classroom.
John M. Abowd, U.S. Census Bureau and Cornell University, will lead the discussion.
Introduction to record linking
- What is record linking, what is it not, what is the theory?
- Record linking: applications and examples – How do you do it, what do you need, what are the possible complications?
- Examples of record linking
Total quality evaluation – errors from coverage, sampling, edit, and imputation.
- Formal models of edits and imputations
- Missing data overview
- Missing records – Frame or census – Survey
- Missing items
- Overview of different products
- Overview of methods
- Formal multiple imputation methods
- INFO7470 S10 -Statistical Tools Edit and Imputation
- INFO7470 S11 -Statistical Tools Edit and Imputation Examples
The lab (an edit and imputation exercise) will be posted on the INFO7470x edX site. You will need to create a program, and upload the program (language of your choice) to edX.
- Why must users of restricted-access data learn about confidentiality protection?
- What is statistical disclosure limitation?
- What are privacy-preserving data mining and differential privacy?
- Basic methods for disclosure avoidance (SDL)
- Rules and methods for model-based SDL
- SDL-based noise methods
- Synthetic data
- Differential privacy methods
- INFO7470 S12 -Updates
- INFO7470-S12-Statistical Disclosure Limitation
- INFO7470-S14-Synthetic Data
- INFO7470 S14 SDS
- Part A: Spatial Analysis (Nicholas Nagle of University of Tennessee – Knoxville)
- Part B: Network Analysis (John Abowd, Cornell University)
Part A: Spatial Analysis
- Basic Geocoding
- Tools for Geocoding
- Analysis Methods
- Tools for Geographic Analysis
About the Guest Lecturer
Nicholas Nagle, University of Tennessee – Knoxville
Part B: Network Analysis
This part of the lecture is a live class.