Skip to content

Commit e9fe0ee

Browse files
committed
build
1 parent 60e50fc commit e9fe0ee

9 files changed

Lines changed: 104 additions & 10 deletions

File tree

Makefile

Lines changed: 13 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
.PHONY: all format test install download upload docker documentation data clean build
1+
.PHONY: all format test install download upload docker documentation data clean build paper clean-paper
22

33
all: data test
44

@@ -49,3 +49,15 @@ build:
4949

5050
publish:
5151
twine upload dist/*
52+
53+
paper: paper/main.pdf
54+
55+
paper/main.pdf: $(wildcard paper/sections/**/*.tex) $(wildcard paper/bibliography/*.bib) paper/main.tex paper/macros.tex
56+
cd paper && \
57+
TEXINPUTS=".:sections/methodology/:" pdflatex main && \
58+
BSTINPUTS=".:bibliography/:" BIBINPUTS=".:bibliography/:" bibtex main && \
59+
pdflatex main && \
60+
pdflatex main
61+
62+
clean-paper:
63+
rm -f paper/*.aux paper/*.bbl paper/*.blg paper/*.log paper/*.out paper/*.toc paper/main.pdf paper/sections/**/*.aux

README.md

Lines changed: 49 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1 +1,50 @@
11
# PolicyEngine US Data
2+
3+
## Installation
4+
5+
```bash
6+
pip install policyengine-us-data
7+
```
8+
9+
## Building the Paper
10+
11+
### Prerequisites
12+
13+
The paper requires a LaTeX distribution (e.g., TeXLive or MiKTeX) with the following packages:
14+
15+
- graphicx (for figures)
16+
- amsmath (for mathematical notation)
17+
- natbib (for bibliography management)
18+
- hyperref (for PDF links)
19+
- booktabs (for tables)
20+
- geometry (for page layout)
21+
- microtype (for typography)
22+
- xcolor (for colored links)
23+
24+
On Ubuntu/Debian, you can install these with:
25+
26+
```bash
27+
sudo apt-get install texlive-latex-base texlive-latex-recommended texlive-latex-extra texlive-fonts-recommended
28+
```
29+
30+
On macOS with Homebrew:
31+
32+
```bash
33+
brew install --cask mactex
34+
```
35+
36+
### Building
37+
38+
To build the paper:
39+
40+
```bash
41+
make paper
42+
```
43+
44+
To clean LaTeX build files:
45+
46+
```bash
47+
make clean-paper
48+
```
49+
50+
The output PDF will be at `paper/main.pdf`.

paper/bibliography/references.bib

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -121,4 +121,14 @@ @article{auerbach2018
121121
number = {3},
122122
pages = {541--576},
123123
year = {2018}
124+
}
125+
126+
@article{saez2012,
127+
title = {The Elasticity of Taxable Income with Respect to Marginal Tax Rates: A Critical Review},
128+
author = {Saez, Emmanuel and Slemrod, Joel and Giertz, Seth H},
129+
journal = {Journal of Economic Literature},
130+
volume = {50},
131+
number = {1},
132+
pages = {3--50},
133+
year = {2012}
124134
}

paper/main.pdf

23.7 KB
Binary file not shown.

paper/main.tex

Lines changed: 17 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,11 +4,27 @@
44
\usepackage{amsmath}
55
\usepackage{natbib}
66
\usepackage{hyperref}
7+
\usepackage{booktabs}
8+
\usepackage{geometry}
9+
\usepackage{microtype}
10+
\usepackage{xcolor}
711

812
\input{macros}
913

14+
\geometry{margin=1in}
15+
\hypersetup{
16+
colorlinks=true,
17+
linkcolor=blue,
18+
filecolor=magenta,
19+
urlcolor=blue,
20+
citecolor=blue,
21+
}
22+
1023
\title{Enhancing Survey Microdata with Administrative Records: \\ A Novel Approach to Microsimulation Dataset Construction}
11-
\author{PolicyEngine Team}
24+
\author{
25+
Nikhil Woodruff\thanks{PolicyEngine} \and
26+
Max Ghenis\thanks{PolicyEngine}
27+
}
1228
\date{\today}
1329

1430
\begin{document}

paper/sections/data.tex

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,5 @@
1-
\section{Data}
1+
\section{Data}\label{sec:data}
2+
23

34
\subsection{Current Population Survey}
45

paper/sections/methodology.tex

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
\section{Methodology}\label{sec:methodology}
2+
3+
% Include methodology subsections
4+
\input{sections/methodology/overview}
5+
\input{sections/methodology/quantile_forests}
6+
\input{sections/methodology/reweighting}

paper/sections/methodology/quantile_forests.tex

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
\section{Quantile Regression Forests}
1+
\section{Quantile Regression Forests}\label{sec:qrf}
22

33
We use quantile regression forests (QRF) in two distinct ways: direct imputation of missing variables, and generation of synthetic records.
44

paper/sections/methodology/reweighting.tex

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
\section{Reweighting Procedure}
1+
\section{Reweighting Procedure}\label{sec:reweighting}
22

33
Our reweighting process optimizes household weights to match administrative targets while determining the relative value of original CPS records versus PUF-derived synthetic records.
44

@@ -62,12 +62,12 @@ \subsection{Optimization Approach}
6262

6363
\subsection{Implementation Details}
6464

65-
From `enhanced_cps.py`:
65+
From \texttt{enhanced_cps.py}:
6666
\begin{itemize}
67-
\item Learning rate: 0.1
68-
\item Dropout rate: 5\%
69-
\item Optimizer: Adam
70-
\item Maximum iterations: 5,000
67+
\item Learning rate: 0.1
68+
\item Dropout rate: 5%
69+
\item Optimizer: Adam
70+
\item Maximum iterations: 5,000
7171
\end{itemize}
7272

7373
% TODO: Add specific convergence metrics and typical runtime statistics

0 commit comments

Comments
 (0)