You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
title = {PyTorch: An Imperative Style, High-Performance Deep Learning Library},
172
+
author = {Paszke, Adam and Gross, Sam and Massa, Francisco and Lerer, Adam and Bradbury, James and Chanan, Gregory and Killeen, Trevor and Lin, Zeming and Gimelshein, Natalia and Antiga, Luca and others},
173
+
journal = {Advances in Neural Information Processing Systems},
The U.S. federal government maintains several microsimulation capabilities through its policy analysis agencies, which form the foundation for official policy analysis and revenue estimation.
18
18
19
-
The Congressional Budget Office's model emphasizes behavioral responses and their macroeconomic effects \cite{cbo2018}. Their approach uses a two-stage estimation process:
19
+
The Congressional Budget Office's model emphasizes behavioral responses and their macroeconomic effects \citep{cbo2018}. Their approach uses a two-stage estimation process:
\item Dynamic scoring: incorporating behavioral responses calibrated to empirical literature
24
24
\end{enumerate}
25
25
26
-
CBO's elasticity assumptions have evolved over time in response to new research, particularly regarding the elasticity of taxable income (ETI). Their current approach varies ETI by income level and type of tax change, broadly consistent with the academic consensus surveyed in \cite{saez2012}. The model also incorporates detailed projections of demographic change and economic growth from CBO's other forecasting models.
26
+
CBO's elasticity assumptions have evolved over time in response to new research, particularly regarding the elasticity of taxable income (ETI). Their current approach varies ETI by income level and type of tax change, broadly consistent with the academic consensus surveyed in \citep{saez2012}. The model also incorporates detailed projections of demographic change and economic growth from CBO's other forecasting models.
27
27
28
-
The Joint Committee on Taxation employs a similar approach but with particular focus on conventional revenue estimates \cite{jct2023}. Their model maintains detailed imputations for:
28
+
The Joint Committee on Taxation employs a similar approach but with particular focus on conventional revenue estimates \citep{jct2023}. Their model maintains detailed imputations for:
29
29
30
30
\begin{itemize}
31
31
\item Business income allocation between tax forms
A distinguishing feature is their treatment of tax expenditure interactions - addressing both mechanical overlap (e.g., between itemized deductions) and behavioral responses (e.g., between savings incentives).
38
38
39
-
The Treasury's Office of Tax Analysis model features additional detail on corporate tax incidence and international provisions \cite{ota2012}. Their approach emphasizes the relationship between different types of tax instruments through a series of linked models:
39
+
The Treasury's Office of Tax Analysis model features additional detail on corporate tax incidence and international provisions \citep{ota2012}. Their approach emphasizes the relationship between different types of tax instruments through a series of linked models:
40
40
41
41
\begin{itemize}
42
42
\item Individual income tax model using matched administrative data
@@ -53,7 +53,7 @@ \subsubsection{Urban Institute Family of Models}
53
53
54
54
The Urban Institute maintains several complementary microsimulation models, each emphasizing different aspects of tax and transfer policy analysis.
55
55
56
-
The Urban-Brookings Tax Policy Center model \cite{tpc2022} combines the IRS Public Use File with Current Population Survey data through predictive mean matching, an approach similar to what we employ in Section~\ref{sec:methodology}. Their imputation strategy aims to preserve joint distributions across variables using regression-based techniques for:
56
+
The Urban-Brookings Tax Policy Center model \citep{tpc2022} combines the IRS Public Use File with Current Population Survey data through predictive mean matching, an approach similar to what we employ in Section~\ref{sec:methodology}. Their imputation strategy aims to preserve joint distributions across variables using regression-based techniques for:
57
57
58
58
\begin{itemize}
59
59
\item Wealth holdings (18 asset and debt categories)
@@ -63,7 +63,7 @@ \subsubsection{Urban Institute Family of Models}
63
63
\item Retirement accounts (DB/DC split and contribution levels)
64
64
\end{itemize}
65
65
66
-
TRIM3 emphasizes the time dimension of policy analysis, with sophisticated procedures for converting annual survey data into monthly variables \cite{trim2024}. Key innovations include:
66
+
TRIM3 emphasizes the time dimension of policy analysis, with sophisticated procedures for converting annual survey data into monthly variables \citep{trim2024}. Key innovations include:
67
67
68
68
\begin{itemize}
69
69
\item Allocation of employment spells to specific weeks using BLS benchmarks
@@ -74,11 +74,11 @@ \subsubsection{Urban Institute Family of Models}
74
74
75
75
This monthly allocation approach informs our treatment of time variation in Section~\ref{sec:data}.
76
76
77
-
The newer ATTIS model \cite{attis2024} focuses on interactions between tax and transfer programs. Building on the American Community Survey rather than the CPS provides better geographic detail at the cost of requiring additional tax variable imputations. Their approach to correcting for benefit underreporting in survey data parallels our methods in Section~\ref{sec:methodology}.
77
+
The newer ATTIS model \citep{attis2024} focuses on interactions between tax and transfer programs. Building on the American Community Survey rather than the CPS provides better geographic detail at the cost of requiring additional tax variable imputations. Their approach to correcting for benefit underreporting in survey data parallels our methods in Section~\ref{sec:methodology}.
78
78
79
79
\subsubsection{Other Research Institution Models}
80
80
81
-
The Institute on Taxation and Economic Policy model \cite{itep2024} is unique in its comprehensive treatment of federal, state and local taxes. Key features include:
81
+
The Institute on Taxation and Economic Policy model \citep{itep2024} is unique in its comprehensive treatment of federal, state and local taxes. Key features include:
82
82
83
83
\begin{itemize}
84
84
\item Integration of income, sales, and property tax microsimulation
@@ -87,7 +87,7 @@ \subsubsection{Other Research Institution Models}
87
87
\item Race/ethnicity analysis through statistical matching
88
88
\end{itemize}
89
89
90
-
The Tax Foundation's Taxes and Growth model \cite{tf2024} emphasizes macroeconomic feedback effects through a neoclassical growth framework. Their approach includes:
90
+
The Tax Foundation's Taxes and Growth model \citep{tf2024} emphasizes macroeconomic feedback effects through a neoclassical growth framework. Their approach includes:
Copy file name to clipboardExpand all lines: paper/sections/data.tex
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -22,7 +22,7 @@ \subsection{Current Population Survey}
22
22
23
23
\subsection{IRS Public Use File}
24
24
25
-
The Internal Revenue Service Public Use File (PUF) is a national sample of individual income tax returns, representing the 151.2 million Form 1040, Form 1040A, and Form 1040EZ Federal Individual Income Tax Returns filed for Tax Year 2015. The file contains 119,675 records sampled at varying rates across strata, with 0.07 percent sampling for strata 7 through 13 \cite{bryant2023b}. The data are extensively transformed to protect taxpayer privacy while preserving statistical properties.
25
+
The Internal Revenue Service Public Use File (PUF) is a national sample of individual income tax returns, representing the 151.2 million Form 1040, Form 1040A, and Form 1040EZ Federal Individual Income Tax Returns filed for Tax Year 2015. The file contains 119,675 records sampled at varying rates across strata, with 0.07 percent sampling for strata 7 through 13 \citep{bryant2023b}. The data are extensively transformed to protect taxpayer privacy while preserving statistical properties.
26
26
27
27
The Public Use Tax Demographic File supplements the PUF with:
The process of projecting historical microdata involves both demographic aging and economic indexing based on US government forecasts. Our aging process occurs in two stages: first to reach our baseline year (2024), and then to project the calibrated dataset forward.
4
+
5
+
\subsubsection{Growth Factor Construction}
6
+
7
+
For each variable in the tax-benefit system with a specified growth parameter, we compute change factors from the base year through 2034:
where f(t) represents the index factor for time t.
40
+
41
+
\subsubsection{Forward Projection}
42
+
43
+
After constructing and calibrating the enhanced 2024 dataset, we project it to future years using the same indexing framework. This maintains the dataset's enhanced distributional properties while reflecting:
44
+
45
+
\begin{itemize}
46
+
\item Economic growth forecasts for monetary variables
47
+
\item Statutory adjustments to program parameters
48
+
\item Population projections applied to household weights
Following the IRS specifications for the Public Use File, we construct three key demographic variables: dependent ages, primary taxpayer age ranges, and earnings splits between spouses.
4
+
5
+
\subsection{Dependent Ages}
6
+
7
+
For each dependent, we construct age categories following IRS constraints:
8
+
\begin{itemize}
9
+
\item Under 5
10
+
\item 5 under 13
11
+
\item 13 under 17
12
+
\item 17 under 19
13
+
\item 19 under 24
14
+
\item 24 or older
15
+
\end{itemize}
16
+
17
+
The number of dependents is limited by filing status:
18
+
\begin{itemize}
19
+
\item Up to 3 dependents for joint returns and head of household returns
20
+
\item Up to 2 dependents for single returns
21
+
\item Up to 1 dependent for married filing separately returns
22
+
\end{itemize}
23
+
24
+
Dependents are ordered sequentially by type:
25
+
\begin{enumerate}
26
+
\item Children living at home
27
+
\item Children living away from home
28
+
\item Other dependents
29
+
\item Parents
30
+
\end{enumerate}
31
+
32
+
\subsubsection{Primary Taxpayer Age}
33
+
34
+
Age ranges are constructed differently for dependent and non-dependent returns:
35
+
36
+
For non-dependent returns:
37
+
\begin{itemize}
38
+
\item Under 26
39
+
\item 26 under 35
40
+
\item 35 under 45
41
+
\item 45 under 55
42
+
\item 55 under 65
43
+
\item 65 or older
44
+
\end{itemize}
45
+
46
+
For dependent returns:
47
+
\begin{itemize}
48
+
\item Under 18
49
+
\item 18 under 26
50
+
\item 26 or older
51
+
\end{itemize}
52
+
53
+
\subsubsection{Earnings Splits}
54
+
55
+
For joint returns, we calculate the primary earner's share of total earnings:
56
+
57
+
\[\text{Primary Share} = \frac{\text{Primary Wages} + \text{Primary SE Income}}{\text{Total Wages} + \text{Total SE Income}} \]
58
+
59
+
Where:
60
+
\begin{itemize}
61
+
\item Primary wages and SE income = E30400 - E30500
62
+
\item Secondary wages and SE income = E30500
63
+
\end{itemize}
64
+
65
+
This share is categorized into:
66
+
\begin{itemize}
67
+
\item 75 percent or more earned by primary
68
+
\item Less than 75 percent but more than 25 percent earned by primary
69
+
\item Less than 25 percent earned by primary
70
+
\end{itemize}
71
+
72
+
\subsubsection{Implementation Details}
73
+
74
+
When decoding age ranges into specific ages, we use random assignment within the range to avoid unrealistic bunching. For example, when the PUF indicates age 80, we randomly assign an age between 80 and 84.
75
+
76
+
The ordering of dependents is preserved when constructing synthetic tax units to maintain consistency with the original data structure.
0 commit comments