Data Pipelines, Data Fluidity: Colin Allen on the “Useful Fiction” of Curated Data
Colin Allen, distinguished professor in the Department of History and Philosophy of Science at the University of Pittsburgh, is both an invited speaker and an ongoing participant in our Seminar; on February 28th, Dr. Allen talked with his fellow participants about his work in what he (and others) call “data pipelines.” Broadly speaking, using data pipelines means that data are collected and recorded in one of many particular ways—but eventually used for purposes other than why they were originally collected. And this means, Dr. Allen pointed out, that data are highly fluid, flexible, and even self-perpetuating. An especially potent example of this in Allen’s own work is his current role as Associate Editor of the Stanford Encyclopedia of Philosophy. While this project has one discreet start date back in 1995, it has been anything but static since then; as of March 2018, the site has approximately 1,600 entries each of which is routinely reviewed and updated. Each new post adds to what is now a highly dynamic reference work containing data culled from all over the web—a pipeline, indeed. Dr. Allen thoughtfully pointed out that as our relationship to data changes over our collective futures, it is important to remember that data does not enter into our world on its own but, rather, it is collected and curated. Allen co-authored an article, “Exploration and Exploitation of Victorian Science in Darwin’s Reading Notebooks,” with Jaimie Murdock and Simon DeDeo in 2017. Charles Darwin left careful records of the books he read from 1837 to 1860, making this Read More