Web
Analytics
top of page
  • Writer's pictureDigital Life Initiative

Liberating Data with FOIA (Part I)

Updated: Feb 13



By Corin Faife (Agence France-Presse)


After leaving a previous role as The Verge’s senior privacy and cybersecurity reporter, I was looking for a chance to pause, catch my breath, and explore new areas. A fellowship at the DLI was a great opportunity to do this, and having now moved on again towards the end of 2023, I’m reflecting on the various topics I was able to dive into in a year at Cornell Tech.

 

The three main areas I covered in this time were FOIA law as it applies to data; applications of AI for investigative journalism; and improving my grasp of JavaScript with a view to building news applications. Here I’m going to dive into the first of the three, in what will be a two part series: starting with general principles in one post then explaining the details of a specific request in another.

 

Applying FOIA law to data

 

Bringing previously unreleased documents to light through FOIA is an exciting parts of journalism, but the process of actually obtaining them can be fairly dry. One of the things it takes is an understanding of the different agencies and departments that make up the government; along with a rough mental model of the different types of records these agencies produce, and enough curiosity, motivation, and persistence to request them.

 

At about the time I decided to sharpen my skills in the above areas, I heard about a new initiative from data journalist/editor Jeremy Singer-Vine, called the Data Liberation Project. 

 

Singer-Vine is widely known in data journalism thanks to his Data Is Plural newsletter, a weekly collection of “useful/curious datasets” running to more than 350 installments at time of writing. The Data Liberation Project — DLP for short — is an extension of this work, but with a shift from curation to investigation. Rather than catalog what's already public, the goal of the DLP is to "identify, obtain, reformat, clean, document, publish, and disseminate government datasets of public interest." That means filing public records requests with governmental agencies and publishing any data received along with the text of the correspondence that helped obtain it: an open-sourcing of the process and results that can serve as a useful point of reference for others hoping to undertake similar work.

 

I reached out to Singer-Vine about the project and he suggested we could jointly file a request. In theory there was an upside for both of us: I could learn from a deeply experienced collaborator, and in answering my questions he could keep refining his approach to FOIA education — feeding back into the DLP’s knowledge-sharing mission. 

 

We ended up filing a joint request in March of 2023 which eventually liberated some previously undisclosed data a full six months later, in October of the same year. Here are some of the lessons I took from the process, drawn from notes I made along the way.

 

Where to find government databases

 

The government collects a lot of data. Over time, thanks to campaigning, legislation, and the internet, there’s been a trend of more data being made available through open data initiatives like Data.gov, but that by no means implies everything of public interest will be made public by default.

 

If we’re talking about data in a general sense – individual records, reports, and so on – then sites like FOIA Wiki do a great job of explaining how to find and request them. But I was interested in following the approach of the DLP, which focuses on finding and requesting datasets from the government that can then be analyzed with the tools and methods of data journalism.

 

So, how does one identify datasets and databases maintained by government agencies? Firstly, by knowing what information agencies must disclose about their data collection practices, and where these notices will be posted. From what I learned, there are three key sources:

 

1. Privacy Impact Assessments

 

In the US, the E-Government Act of 2002, Section 208, created a requirement for agencies to conduct privacy impact assessments (known as PIAs) when they collect personal data to input into electronic information systems. Many agencies make these available in a distinct section of their governmental website — for example, here’s a nicely formatted list of PIAs uploaded by DHS’s Cybersecurity and Infrastructure Security Agency (CISA). 





So one place to start is by visiting the website of a department or agency with jurisdiction over your area of interest, and browsing or searching for PIAs published there to see if any of them hint at the existence of a compelling dataset that is not yet public. (The question of how to know what makes a dataset compelling and/or newsworthy is a key intuition to develop as a data journalist, but is outside the scope of this blog post…)


2. System of Record Notices

 

PIAs are useful but they’re not the only way that agencies signal an intent to collect data. Thanks to the Privacy Act of 1974, government agencies are required to post notices about “systems of records” – broadly a synonym for databases, though encompassing offline filing systems – in the Federal Register, which is the journal of the government of the United States.

 

In fact, there’s a whole category of entries in the Register known as System of Records Notices (SORNs), and searching for “system of records” plus an agency name usually brings up many references to government databases. (There’s also a dedicated page just for Privacy Act notices.)

 

Finding these notices can be very helpful for framing a FOIA request because the notifying agency will identify the system of records by name and explain its purpose. For example, in this March 2023 notice, the Fish and Wildlife Service details its intent to modify 11 of its current systems of records to add a procedure for responding to data breaches. After the summary of the notice, all 11 of these systems are identified by their official designation and usage:

 



With these details published, SORNs in the Federal Register are a great way to find out about government databases. They can be found via search engine too without visiting the Register: a basic search of “[Agency name] SORNs” is often enough to bring up a page on the relevant agency’s website, e.g. the top result for “Department of the Interior SORNs” is this page:



One thing to take into account though: not all government databases are covered by the Privacy Act, as some are deemed not to contain any personal information that is subject to privacy law.

 

3. Information collection requests

 

SORNs are good for getting a high-level overview of government information systems, but for a more granular way to see what data agencies are gathering we can turn to information recorded by OIRA, the Office of Information and Regulatory Affairs.

 

OIRA was created by the 1980 Paperwork Reduction Act (PRA), a law governing how federal agencies collect information from the public. The aims of the PRA are to avoid burdening the public with unnecessary information requests, and to make sure that data collected really is a good fit for its proposed use. What this means in practice is that when agencies wish to collect information from the public, they must first get approval from OIRA by submitting a request.

 

OIRA has a website, reginfo.gov, that holds information about all of the different information collection programs that have been submitted to OIRA. It can be a little difficult to navigate, but there’s a page that lets you access an inventory of all currently active information collections, and a search tool with many different options to search by agency, sub-agency, date of request, and other details that collecting agencies must supply such as estimations for the number of respondents and time/cost burden for those who will respond to the survey.

 

With the OIRA search function you can select an agency and sub-agency of interest – a great help if you have a general area of interest in mind – and filter by requests that are currently active.



As an example, running a search on ICRs coming from the Federal Railroad Administration turns up a list of responses that point to data that could, if obtained, potentially form the basis of a news story: from the crashworthiness of locomotives, to noise exposure for railroad employees, to railway bridge safety standards, and more.





So: having found a reference to a relevant database, data collection project, or system of records through one of those three sources – now what?


The short answer is that it’s time to craft a FOIA request to ask for records contained in the system. The longer answer is that specific wording might be needed to receive these records in a format suitable for data analysis, which we’ll get into in Part II of this blog series.

 

In the meantime, all of the Data Liberation Project’s previous records requests are online if you’re looking for inspiration, and more information about the sleuthing process is available in the DLP’s Fathoming Federal Data guide.



Corin Faife

(AFP) Agence France-Presse

DLI Alum

Cornell Tech



Cornell Tech | 2024



17 Comments


Memozi Liza
Memozi Liza
Aug 15

You can have hours of tasty fun with redactle game, whether you're an experienced puzzle fan or just looking for some light entertainment.

Like

Ismail Yusibov
Ismail Yusibov
Aug 14

https://drive.google.com/file/d/1YXrkB-NG_QevKnDeJrnPW3LeK-CnKgEj/view?usp=sharing

https://acrobat.adobe.com/id/urn:aaid:sc:AP:40e37e2d-3ad6-4aeb-bd3c-fb106124f83c

https://issuu.com/adanaweb/docs/i_zmir_i_avukat_-_i_zmir_tazminat_avukat_

https://www.dropbox.com/scl/fi/janiea8ww5gl79t9aq8sg/zmir-Avukat-zmir-Tazminat-Avukat.pdf?rlkey=saz3jm0or6xntt5l8jq7o4ury&st=csondv4s&dl=0

https://onedrive.live.com/?authkey=%21AJtFI7YCQcME2tg&id=14CCB181B8F90950%2112800&cid=14CCB181B8F90950&parId=root&parQt=sharedby&o=OneUp

https://www.slideshare.net/slideshow/izmir-is-avukati-izmir-tazminat-avukati/270280916

https://www.scribd.com/document/750992136/%C4%B0zmir-%C4%B0%C5%9F-Avukat%C4%B1-%C4%B0zmir-Tazminat-Avukat%C4%B1

https://www.4shared.com/s/f-3gChbB_ge

https://www.canva.com/design/DAGLJxb-RUU/2WDXpDcJNoail3mz-iu3rA/edit?utm_content=DAGLJxb-RUU&utm_campaign=designshare&utm_medium=link2&utm_source=sharebutton

https://www.academia.edu/122105563/%C4%B0%C5%9F_Davalar%C4%B1nda_Avukatl%C4%B1k_S%C3%BCre%C3%A7leri_ve_Stratejileri?source=swp_share

https://www.calameo.com/read/005981559a10982158c45

https://app.box.com/s/dnuybw9kbkw6c03xa9voz22bftfmnlif

https://www.yumpu.com/tr/document/read/68754247/izmir-is-avukat-izmir-tazminat-avukat

https://www.pearltrees.com/s/file/preview/330475173/zmir%20%20Avukat%20-%20zmir%20Tazminat%20Avukat.pdf?pearlId=622001833

https://www.emaze.com/@ALIOQTWFC/blank

https://jumpshare.com/v/t6UjIH1Pdx2lFjhdELa7

https://drive.proton.me/urls/EWWC19W13W#C2gt9dflspFU

https://www.edocr.com/v/dxeqlwj2/yusufeseryesil/izmir-is-avukati-izmir-tazminat-avukati

https://pdf.ac/1U1kPD

https://smallpdf.com/file#s=62861781-2fdb-4274-8e16-fac334930fe5

https://www.deviantart.com/stash/01khg6yev1mh

https://anyflip.com/mowkq/osda

https://www.opendrive.com/file/NDBfMTAzNjIwOTMzX0Y4a1ZG

https://e.pcloud.link/publink/show?code=XZag8gZD6xMtsAi0TyitMlflNczbYQL5b8X

https://online.pubhtml5.com/zjuob/zuer/

https://docdro.id/0KaWcdu

https://filetools7.pdf24.org/client.php?mode=inline&file=joinPdf_32cd029df41049b81cba48e04386bd7d_17002047999462811835.pdf&action=getFile

https://adanaweb.dropmark.com/1740546/34594504

https://pdfhost.io/v/cSMM565J._izmirisavukatiizmirtazminatavukatipdf

https://workdrive.zohopublic.eu/file/02xs8e0c45f811fc949bb9c374dfeab2647ad

https://drive.google.com/file/d/1yIL0XuHbdT4Se4j66uanxlztm99oUIWJ/view?usp=sharing

https://acrobat.adobe.com/id/urn:aaid:sc:AP:16e8841d-9eba-4c5d-8b88-798f4cd1db9f

https://issuu.com/adanaweb/docs/pol_health_care_-_implant_dentystyczny

https://www.dropbox.com/scl/fi/glgfu1y1llzfjadp5sogf/POL-Health-Care-implant-dentystyczny.pdf?rlkey=8bkscy97668f9670kferdy00p&st=f4zv79wy&dl=0

https://onedrive.live.com/?authkey=%21AMEfswwtdMlWDc4&id=14CCB181B8F90950%2112799&cid=14CCB181B8F90950&parId=root&parQt=sharedby&o=OneUp

https://www.slideshare.net/slideshow/pol-health-care-implant-dentystyczny/270280917

https://www.scribd.com/document/750992156/POL-Health-Care-Implant-Dentystyczny

https://www.4shared.com/s/fha0DACl0ge

https://www.canva.com/design/DAGLJ24IQOY/Uo9LauqolXcqyuxcvzcLoA/edit?utm_content=DAGLJ24IQOY&utm_campaign=designshare&utm_medium=link2&utm_source=sharebutton

https://www.academia.edu/122105608/POL_Health_Care_implant_dentystyczny?source=swp_share

https://www.calameo.com/read/005981559b76d7a29a2c6

https://app.box.com/s/oqajvqwaqo8zz9fkhnh7q4z948fu12be

https://www.yumpu.com/en/document/read/68754253/pol-health-care-implant-dentystyczny

https://www.pearltrees.com/s/file/preview/330475172/POL%20Health%20Care%20-%20implant%20dentystyczny.pdf?pearlId=622001834

https://www.emaze.com/@ALIOQTWCC/blank

https://jumpshare.com/v/5YGG33xiLihPtWiVP2tH

https://drive.proton.me/urls/4323M172GM#UXx6Jdl1zCtg

https://www.edocr.com/v/69mvqjm6/yusufeseryesil/pol-health-care-implant-dentystyczny

https://pdf.ac/sdNy2

https://smallpdf.com/file#s=667d93d7-735b-4768-82c6-63adf55a7ca0

https://www.deviantart.com/stash/0sx5wr3lbzt

https://anyflip.com/mowkq/ppdr/

https://www.opendrive.com/file/NDBfMTAzNjIwOTM0X3d4YUJz

https://e.pcloud.link/publink/show?code=XZ3g8gZPzHXLnnyGv7bokDME3ImPJYbsszk

https://online.pubhtml5.com/zjuob/jedm/

https://docdro.id/sWrvVUd

https://filetools24.pdf24.org/client.php?mode=inline&file=joinPdf_814d59b54ef615bf42e05abf20320dd0_13777635519465239466.pdf&action=getFile

https://adanaweb.dropmark.com/1740546/34594505

https://pdfhost.io/v/tz.5kNNem_polhealthcareimplantdentystycznypdf

https://workdrive.zohopublic.eu/file/02xs873ace18709b7402cbff1cea897af9bcd

https://drive.google.com/file/d/11SBZnkkLu9su3Yu5SJb-vlQARZZVOef_/view?usp=sharing

https://acrobat.adobe.com/id/urn:aaid:sc:AP:c73f969d-3fb2-4034-a873-55df54f9c101

https://issuu.com/adanaweb/docs/sanal_sunucu_-_wordpress_hosting

https://www.dropbox.com/scl/fi/4hbdlcp9wkztqo8xmo3rf/sanal-sunucu-wordpress-hosting.pdf?rlkey=9452htvzwhalqpuise9q2mbk0&st=y1tjtzd8&dl=0

https://onedrive.live.com/?id=14CCB181B8F90950!12798&resid=14CCB181B8F90950!12798&ithint=file%2cpdf&authkey=!AN_lEmMgAyD0F3o&cid=14ccb181b8f90950

https://www.slideshare.net/slideshow/sanal-sunucu-wordpress-hosting/270280918

https://www.scribd.com/document/750992154/Sanal-Sunucu-Wordpress-Hosting

https://www.4shared.com/s/fZ0rN33DRjq

https://www.canva.com/design/DAGLJ1oGncU/LkAvFwS6tDrfrHGTBZPi5Q/edit?utm_content=DAGLJ1oGncU&utm_campaign=designshare&utm_medium=link2&utm_source=sharebutton

https://www.academia.edu/122105625/Sanal_sunucu_wordpress_hosting?source=swp_share

https://www.calameo.com/read/005981559cffba3d56d01

https://app.box.com/s/x1670ouj4jiafk2c3whfgui1bpn0eihr

https://www.yumpu.com/tr/document/read/68754254/sanal-sunucu-wordpress-hosting

https://www.pearltrees.com/s/file/preview/330475171/sanal%20sunucu%20-%20wordpress%20hosting.pdf?pearlId=622001831

https://www.emaze.com/@ALIOQTWCT/blank

https://jumpshare.com/v/sykXfebsGMjwbHFQV1Z4

https://drive.proton.me/urls/M5BJ3MQGZ0#7Nq5SiwP0CDK

https://www.edocr.com/v/lprnprke/yusufeseryesil/sanal-sunucu-wordpress-hosting

https://pdf.ac/1umClv

https://smallpdf.com/file#s=d1746cd8-6440-401b-9b33-465fcb8cd6b6

https://www.deviantart.com/stash/01zm2tvtp01n

https://anyflip.com/mowkq/vpct/

https://www.opendrive.com/file/NDBfMTAzNjIwOTM2X1RMa1RO

https://e.pcloud.link/publink/show?code=XZGg8gZb54ALMDBDQLYlUiK3KS75pU53lxk

https://online.pubhtml5.com/zjuob/hhqx/

https://docdro.id/VwQQh4L

https://filetools13.pdf24.org/client.php?mode=inline&file=joinPdf_dff7ffc9d105d4f2b60cb881a755419b_12686048834983960351.pdf&action=getFile

https://adanaweb.dropmark.com/1740546/34594506

https://pdfhost.io/v/IfPlZjeIC_sanal_sunucu_wordpress_hosting

https://workdrive.zohopublic.eu/file/02xs87c88ca84fa1542d6a334770bf9bb4062

https://drive.google.com/file/d/1rSG8B5DgvOCb9w6lDKQ6k_0LB2ttZQpp/view?usp=sharing

https://acrobat.adobe.com/id/urn:aaid:sc:AP:bca52396-3194-4320-94b7-622aa47b1b44

https://issuu.com/adanaweb/docs/_nsped_m_avirlik_-_ugm_ihracat_g_mr_kleme

https://www.dropbox.com/scl/fi/3rjes3sinr76otxuccve7/nsped-m-avirlik-ugm-ihracat-g-mr-kleme.pdf?rlkey=6nyb5r1ncokxiv1pntjvmtrb2&st=lmbp0bd9&dl=0

https://1drv.ms/b/s!AlAJ-biBscwU5AGnkDc00_n1P2L7?e=KTGt6f

https://www.slideshare.net/slideshow/unsped-musavirlik-ugm-ihracat-gumrukleme/270280919

https://www.scribd.com/document/750992157/Unsped-Mu%C5%9Favirlik-Ugm-Ihracat-Gumrukleme

https://www.4shared.com/s/faZ2jgDg_ku

https://www.canva.com/design/DAGLJ9gMqFI/DDFzNOYr5bh4BkD-t7ARpg/edit?utm_content=DAGLJ9gMqFI&utm_campaign=designshare&utm_medium=link2&utm_source=sharebutton

https://www.academia.edu/122105665/%C3%9Cnsped_m%C3%BC%C5%9Favirlik_ugm_ihracat_g%C3%BCmr%C3%BCkleme?source=swp_share

https://www.calameo.com/read/0059815592a6e3ae59c97

https://app.box.com/s/j7axh2kv9deh244prccwwegtql2f00rw

https://www.yumpu.com/tr/document/read/68754258/unsped-musavirlik-ugm-ihracat-gumrukleme

https://www.pearltrees.com/s/file/preview/330475170/unsped%20muavirlik%20-%20ugm%20ihracat%20gumrukleme.pdf?pearlId=622001832

https://www.emaze.com/@ALIOQTWZO/blank

https://jumpshare.com/v/ncHg5qxyu7b7muABFIFY

https://drive.proton.me/urls/6FBYCWGANW#nweG9flw3zCg

https://www.edocr.com/v/vlex0qwg/yusufeseryesil/unsped-musavirlik-ugm-ihracat-gumrukleme

https://pdf.ac/3gGJX3

https://smallpdf.com/file#s=eb6c6886-2655-4618-96d6-955f5a4afcfd

https://www.deviantart.com/stash/01bra8f64wuf

https://anyflip.com/mowkq/hplo/

https://www.opendrive.com/file/NDBfMTAzNjIwOTM4X3BhZDNn

https://e.pcloud.link/publink/show?code=XZvg8gZKO76fvSU6Fz5GawfKborJVqCbWvV

https://online.pubhtml5.com/zjuob/yfhp/

https://docdro.id/zd6Przy

https://filetools28.pdf24.org/client.php?mode=inline&file=joinPdf_dedde9d5ec6d68d834e24f6d3f99ec83_10270886620757754220.pdf&action=getFile

https://adanaweb.dropmark.com/1740546/34594507

https://pdfhost.io/v/yrpCaJFrC_nsped_mavirlik_ugm_ihracat_gmrkleme

https://workdrive.zohopublic.eu/file/02xs8fd91fa05ec604efaad319632901b1219

Like

Guest
Jun 13

thanks for the info

Like

mikeakerson1321
May 24

Promotional codes should be entered at checkout. Only one promotional code may be entered in keeping with order. Discounts on eligible iHerb promo products could be applied in-cart. Certain excluded brands do no longer qualify for promotional reductions.

Like

edwinjubal1231
May 18

In the phase of deep meditation, sleep, or hypnosis, the dominant wave is the Theta one and scientists have concluded ozone therapy bali that this frequency has the ability to lower stress and anxiety, lead to deep relaxation, enhance the mental clarity and creativity, minimize ache, and increase euphoria.

Like
bottom of page