Studying the Reader/researcher without the Artifact: Digital Problems in the Future History of Books
Warner, Dorothy, Buschman, John, Library Philosophy and Practice
It is salient to begin this article with some examples of fertile and groundbreaking study emanating from the history of the book, reading, and publishing:
* Robert Darnton brilliantly re-constructed the world-view of 18th century French society from the ground up in his book The Great Cat Massacre. He did so by reinterpreting odd and rare documents such as a printing society's wage book, a semi-fictional autobiography of a printshop worker, and an odd, obsessively complete "inventory" of the city of Montpellier. 
* Justin Kaplan's notes in his Library of America edition of Whitman's Leaves of Grass list eight different editions Whitman produced and edited, the first consisting of twelve poems and a preface, others expanding to four times the length, and then contracting again. Like all of Whitman's later compilers and editors, Kaplan faced the author's injunctions declared at various times on the variety of editions, in order to come up with a complete or definitive edition. 
* Wayne Wiegand has studied odd documents of library history like Library Bureau accession/de-accessioning books used in most small American public libraries to record the acquisition of books. Wiegand productively studied the censorship of controversial materials in some of those libraries over a 66-year period using these records. 
* Jonathan Katz  and Martin Duberman  are scholars who have researched and documented the history of the gay experience in America. Over the course of 25 years, they have examined previously unpublished and overlooked documents discovered through various means: by communication with gay people; by following up on rumor and vaguely remembered diaries and papers; by following obscure trails left in footnotes, much of which was located in privately-owned and only-recently gathered library archival collections.
What do these examples have in common? They represent important and interesting work that could be accomplished because the documents and the publications exist, and they exist primarily because they were printed and reprinted, simply kept somewhere, preserved and archived. The study of reading, books, book production, editing, and the research process posits a very simple assumption: that which has been read, edited, absorbed, used and studied will still exist as an artifact. As Ronald Schuchard wrote, "what interests the scholar ... in the archive [is] the preservation and accessibility of the materials of the creative imagination, the physical materials, including all the detritus, debris, and ephemera of art, biography, history. And the archival preservation of these materials is crucial for the minor as for the major figures of a literary generation" --the very authors, as Michael Winship  points out, that most people read the most, after all.
However, the trend toward digitization, promoted by those who want information available instantly and in a "more accessible" format, poses a very fundamental challenge to the essential assumption that those items will exist in future. The dramatic move to exclusive web-distribution of federal and state government information and data in the United States is a good case study of this problem. Essentially, this project has been undertaken without planning or budgeting for archived, permanent and secure (hat is, unaltered) access. A front page story in the New York Times detailed the digitization project in the US Patent Office of 18th and 19th century patents--and the discarding of the original documents. One person did some dumpster diving outside the Office and came up with four original application copies of some of Thomas Edison's patents.  Much of the newly-digitized data is the raw material for scholars in such far-flung subjects as law, the environment, education, demography, and of course economics and business. Data and documents are not in danger only from governmental sources, but in private databases as well. …