Jeffrey Clark

Economist · Open-Source Contributor

PhD student, Stockholm University

My research is in development, labor, and environmental economics. I also lead RegiStream, open-source register-data infrastructure used by researchers and government agencies.

  • Visit Summer 2026 · in days In Beijing · day of Summer 2026 · completed
    Visiting Researcher at the World Resources Institute, Beijing
  • R&R May 2026
    autolabel: Automating variable and value labeling in Stata
    Revise and Resubmit at the Stata Journal
  • Press May 2026
    Stockholm University Department of Economics

Selected work

  1. 2026 R&R Stata Journal

    autolabel: Automating variable and value labeling in Stata

    Jeffrey Clark · Jie Wen

    Register-based datasets from national statistical agencies arrive with cryptic variable names, unlabeled coded values, and documentation that lives separately from the data, often only in the national language. The autolabel command automates variable and value labeling by matching dataset variables to centralized, multilingual CSV metadata repositories and applying labels in a single call. A deferred execution pattern writes labeling commands to a temporary do-file during dataset inspection and applies them after the original state is restored, letting a single user-level command both inspect and apply. The domain-agnostic CSV schema lets any institution author its own metadata bundle.

  2. 2022 MSc thesis Stockholm School of Economics

    Rural Road Improvements and Local Agricultural Intensification: A Remote Sensing Evaluation in Mozambique

    Jeffrey Clark · Tamina Matti

    Transaction costs serve as an obstacle to competitive market exchanges in rural and remote areas around the world. Improvements to transportation infrastructure are hypothesized to lower these costs and help alleviate poverty among smallholder farmers. Yet, few empirical studies estimate the effect of improved rural infrastructure on agricultural output, especially in the sub-Saharan context. This thesis investigates whether rural road upgrades in northern Mozambique have any short-term effects on agricultural output; specifically, we evaluate the early effects of an ongoing World Bank project. By employing remote sensing and machine learning methods, we identify rural road upgrades that took place between 2018 and 2021. Using a differences-in-differences approach, we find that areas in immediate proximity to roads that received an upgrade did not experience changes in agricultural output, compared to areas that did not receive an upgrade. We restrict the sample and find a significant increase in agricultural output, although not robust. Future research should consider the medium- and long-term impact of rural road upgrades for the complete picture to emerge. While sole dependence on remote sensing data remains a challenge in economics, it is a promising avenue for future research, particularly in contexts where comprehensive survey data is lacking.

    Keywords: remote sensing · rural road improvements · agricultural output · Mozambique · economic development

    Main data contribution: a remote-sensing pipeline that detects rural road upgrades from Sentinel-2 imagery, useful for studying upgrade impacts where survey data is sparse. Below, six road segments our pipeline flagged as upgraded between 2020 and 2021.

    Pre · 2020
    Post · 2021
    Segment:

Research in progress

Active projects and measurement work at different stages of development.

Remote sensing

Mapping environmental conditions from satellite imagery

Conditions like heavy metals, water-quality indicators, and mining residues are difficult to measure across space and time. The project trains models on hyperspectral satellite imagery to infer such conditions from Sentinel-2 observations at population scale. Hyperspectral imagery is spectrally rich but spatially limited; Sentinel-2 is the opposite. Transferring the spectral signal between them enables environmental measurement where direct sampling is impractical.

The methodology is portable across pollutants, biophysical indicators, and regions.

Kenya

Customary tenure and investment under community land reform

Kenya's 2016 Community Land Act sought to formalize collective ownership over communally-held land (about a tenth of the country's territory) by issuing title to community groups rather than to individual households. The project asks whether community titling translates into agricultural investment, conservation outcomes, and changes in intra-community welfare.

Because household surveys do not exist for the 92 communities gazetted under the Act, the analysis is built from scratch: scraped Environment and Land Court records, NLP-extracted property identifiers, and Sentinel-2 satellite imagery, fused into a community-level panel. The setting sits at the institutions × property-rights frontier; community-titled regimes are common across sub-Saharan Africa but remain understudied.

Climate × health

Floods, mental health, and the mitigating role of dams

In Sweden, sickness benefits and near-universal property insurance absorb the earnings cost of river floods, leaving little detectable income disruption. The project asks what residual welfare cost remains, and whether flood exposure shows up as persistent mental-health effects (depression, stress disorders, sleeping-pill use) in clinical and prescription records.

A second question is whether engineered flood-attenuation infrastructure reduces this welfare cost during extreme events. If it does, standard cost-benefit analyses of water-management infrastructure may miss an important non-market benefit.

In progress.

Map of Sweden showing 117 river basins colored by dam attenuation, with hydropower plants marked as triangles.
Setting: Sweden's 117 river basins coloured by dam attenuation. Black triangles mark hydropower plants, scaled by capacity.

Projects

Open-source infrastructure I build and share.

RegiStream

Active

Open-source infrastructure layer between register data and research. A multilingual metadata catalog with two companion tools (autolabel and datamirror), used by researchers and government agencies across the Nordics.

registream.org →

Stitching Services

Predoc, 2023–24

Distributed orchestration for the Aerial History Project stitching pipeline at 1M+ image scale. Docker and Apptainer containers running across Berkeley's Savio HPC cluster, automated via encrypted SSH/TOTP.

  • 1M+ images processed
  • Docker · Apptainer
  • Savio HPC
stitching-services →