Author Topic: Truncated Public Domain  (Read 253 times)

0 Members and 1 Guest are viewing this topic.

Offline mac123

  • Newbie
  • *
  • Posts: 2
  • I love Boardnation!
    • View Profile
Truncated Public Domain
« on: June 19, 2009, 10:24:28 AM »
These problems are exacerbated by Google’s rather peculiar views on copyright. While taking an expansive view of copyright for recent works, it has taken a very narrow view about books that actually are in the public domain. As I have always understood it (and the U.S. Copyright Office confirms), “works by the U.S. government are not eligible for U.S. copyright protection.” But Google locks all government documents published after 1923 1922* behind the same wall as any other copyrighted work. Among other things, that locks up works that should be in the public domain, such as the AHA’s Annual Report (published by the Government Printing Office from 1890 to 1993) and circulars from the U.S. Bureau of Education. This problem is exacerbated by the often errant data about when these materials were published—which places these works even further beyond reach.

For more than a year now, Siva Vaidhyanathan, a cultural historian and media scholar at New York University, has been objecting that the rush to digitize is moving far in advance of considered thought. His concerns seemed rather abstract when I first heard them last year, but working with Google Books over the past few months made his objections seem much more tangible and worrying.

What particularly troubles me is the likelihood that these problems will just be compounded over time. From my own modest experience here at the AHA, I know how hard it is to go back and correct mistakes online when the imperative is always to move forward, to add content and inevitably pile more mistakes on top of the ones already buried one or two layers down. With Google adding in more than 3,000 new books each day, the growth in the number of mistakes seems that much higher.

The problem of quality control only exacerbates my most basic worry about the larger rush to digitize every scrap of information—that we are adding to the pile much faster than the technology can advance to extract the information in a useful or meaningful way. When I have asked people who know a lot more about the technology than me about this problem, they tend to wave their hand and mumble about “brilliant scientists” and “technological progress.” Forgive me if I remain unconvinced. Even for someone fairly proficient in Boolean search terms I find a lot of the results from Google Books (and Google more generally) just page after page of useless and irrelevant information. I find it increasingly hard to believe that Google can add tens of thousands of additional books each month to the information pile—many containing basic mistakes in content and metadata—and the information results will actually grow better over time.

So I have to ask, what’s the rush? In Google’s case the answer seems clear enough. Like any large corporation with a lot of excess cash the company seems bent on scooping up as much market share as possible, driving competition off the board and increasing the number of people seeing (and clicking on) its highly lucrative ads. But I am not sure why the rest of us should share the company’s sense of haste. Surely the libraries providing the content, and anyone else who cares about a rich digital environment, needs to worry about the potential costs of creating a “universal library” that is filled with mistakes and an impenetrable smog of information. Shouldn’t we ponder the costs to history if the real libraries take error-filled digital versions of particular books and bury the originals in a dark archive (or the dumpster)? And what is the cost to historical thinking if the only substantive information one can glean out of Google is precisely the kind of narrow facts and dates that make history classes such a bore? The future will be here soon enough. Shouldn’t we make sure we will be happy when we get there?



Offline kangyu

  • Newbie
  • *
  • Posts: 15
  • I love Boardnation!
    • View Profile
Re: Truncated Public Domain
« Reply #1 on: December 17, 2009, 08:19:18 PM »
These problems are exacerbated by Google’s rather peculiar views on copyright. While taking an expansive view of copyright for recent works, it has taken a very narrow view about books that actually are in the public domain. As I have always understood it (and the U.S. Copyright Office confirms), “works by the U.S. government are not eligible for U.S. copyright protection.” But Google locks all government documents published after 1923 1922* behind the same wall as any other copyrighted work. Among other things, that locks up works that should be in the public domain, such as the AHA’s Annual Report (published by the Government Printing Office from 1890 to 1993) and circulars from the U.S. Bureau of Education. This problem is exacerbated by the often errant data about when these materials were published—which places these works even further beyond reach.

For more than a year now, Siva Vaidhyanathan, a cultural historian and media scholar at New York University, has been objecting that the rush to digitize is moving far in advance of considered thought. His concerns seemed rather abstract when I first heard them last year, but working with Google Books over the past few months made his objections seem much more tangible and worrying.

What particularly troubles me is the likelihood that these problems will just be compounded over time. From my own modest experience here at the AHA, I know how hard it is to go back and correct mistakes online when the imperative is always to move forward, to add content and inevitably pile more mistakes on top of the ones already buried one or two layers down. With Google adding in more than 3,000 new books each day, the growth in the number of mistakes seems that much higher.

The problem of quality control only exacerbates my most basic worry about the larger rush to digitize every scrap of information—that we are adding to the pile much faster than the technology can advance to extract the information in a useful or meaningful way. When I have asked people who know a lot more about the technology than me about this problem, they tend to wave their hand and mumble about “brilliant scientists” and “technological progress.” Forgive me if I remain unconvinced. Even for someone fairly proficient in Boolean search terms I find a lot of the results from Google Books (and Google more generally) just page after page of useless and irrelevant information. I find it increasingly hard to believe that Google can add tens of thousands of additional books each month to the information pile—many containing basic mistakes in content and metadata—and the information results will actually grow better over time.

So I have to ask, what’s the rush? In Google’s case the answer seems clear enough. Like any large corporation with a lot of excess cash the company seems bent on new york asian escort scooping up as much market share as possible, driving new york escort competition off the board and increasing the number of people seeing (and clicking on) its highly lucrative ads. But I am not sure why the rest of us should share the company’s sense of haste. Surely the libraries providing the content, and anyone else who cares about a rich new york asian escorts digital environment, needs to worry about the potential costs of creating a “universal library” that is filled with mistakes and an impenetrable smog of information. Shouldn’t we ponder the costs to history if the real libraries take error-filled digital versions of particular books and new york escorts bury the originals in a dark archive (or the dumpster)? And what is the cost to historical thinking if the only substantive information one can glean out of Google is precisely the kind of narrow facts and dates that make history classes such a bore? The future will be here soon enough. Shouldn’t we make sure we will be happy when we get there?