The Semantic Web, circa 1934

2 minute read

The Times has a great story today by Alex Wright on Paul Otlet’s early efforts to create a network of the information akin to today’s Web. In spite of bloviating along the lines of “The hyperlink is one of the most underappreciated inventions of the last century” (Kelvin Kelly, quoted for the article, apparently both asleep during the technology boom and never having read his own magazine, Wired), Wright’s piece treats Otlet’s work surprisingly fairly and is sensitive to the promise and limits of his analog approach. On the delivery side, Otlet imagined amalgamating the cutting-edge media technology of the day: telephone, radio, television. The glue for all this data would be the laborious human-directed cataloging and organization of information.

Of course there is a much longer history to the attempt to forge universal networks of information. To a historian of France, Diderot and D’Alembert’s Encyclopédie springs to mind. Spanning 28 volumes of text and plates, published over the course of two decades, and including nearly 80,000 entries, the Encyclopédie introduced readers to the cross-reference (the most underappreciated invention of the eighteenth century?) and also explicitly and implicitly connected them to the relevant texts of the day, either through cited references or outright plagiarism.

The success of the Encyclopédie stemmed as much from the print technology it exploited as from the extraordinary individuals who participated in the project. Over 140 individuals contributed articles. Some were experts in their fields, while others were generalists attempting to synthesize a wide range of knowledge. A single contributor, the chevalier de Jaucourt, produced over 17,000 articles, averaging over eight per day. Yet even in the eighteenth century, this massive endeavor could not keep pace with knowledge production. Wikipedia of course today brings a far larger population of contributors to bear, but it effectively frames the problem no differently, simply applying twentieth-century technology to an eighteenth-century problem.

With Diderot and D’Alembert’s Encyclopédie and Otlet’s Mundaneum, we get the sense of historical actors confronting a coming tsunami in human knowledge. Both the eighteenth century’s explosion in printing and literacy and the early twentieth century’s new media challenged existing taxonomies of knowledge. What’s missing from today’s efforts, hinted at by the Times piece, is the human element. The old Stanford-era Yahoo was limited but extremely useful because human beings created and populated the taxonomy by hand. Google is today almighty, but it’s essentially a dumb interface, and as the corpus of digital media continues to mushroom we’re as likely to be rickrolled or googlewhacked as find the information we seek. It remains to be seen to what extent machine learning and data mining can identify and weave together semantic meaning in digital media.

Share on

Twitter Facebook LinkedIn

More Hiring!

1 minute read

Over the past few months my new team has been modeling digital research methods into modular “workflows” that can be mixed and recombined in an open-source software workbench we have been prototyping at C²DH. We have made good progress with our small team of five and are now ready moving to the next stage of the project, where we will hire additional humanities researchers and software developers to continue prototyping and begin implementation. Today we’re announcing searches for two new developers to join us this fall. The new developer positions are roughly divided between front-end and back-end responsibilities. For the front-end developer, we’re looking for someone with extensive experience in JavaScript for software development (ideally including React). Because we’re designing...

Now Hiring!

1 minute read

In a few weeks I’ll begin a project – the Digital History Advanced Research Projects Accelerator – as part of my new position at the Luxembourg Centre for Contemporary and Digital History (C²DH). The Accelerator aims to create and promote software that capitalizes on digitalized historical research practices. More important than this software – and the key to its success – is the team I am currently recruiting to participate in its development and design. With the generous support of the Luxembourg National Research Fund and the University of Luxembourg, we’re running searches for a dozen new positions over the next year, including three postdoctoral researchers, three software developers, four fully-funded PhD positions, and administrative staff, and we also have...

Tropy

2 minute read

In late November 2010, on the last day of a research visit to Aix-en-Provence, I set down some thoughts on how historians’ practices had seemed to change overnight: what is the future of the archive? archival work now focused more narrowly, more intensely. but also potentially doesn’t provide enough time to get the feel for the archives and change direction. digital photography as a major change. archives now about raw collection, little or no feedback loop between what’s being observed and what comes next. requires a fundamentally different rhythm, one that i’m not yet comfortable with My reflections — which went on to include such uninspired predictions as “finding aids will soon all be online” — drew on my own...

Long-term Sustainability of PressForward

2 minute read

As this year’s Open Access Week winds down, I’m really pleased to share that the Alfred P. Sloan Foundation will generously fund a new, three-year phase of PressForward to ensure its long-term sustainability. The key deliverables of the grant are the launch of a dozen high-quality science publications and the continued refinement of the PressForward software to lower the barrier to entry for new research groups who want to create a collaborative publication. While “sustainability” is probably the most dreaded criterion of any grant application, it’s an area that I actually enjoy working on in its own right. At RRCHNM we’ve been fortunate to shepherd Zotero and Omeka through phases of grant funding that were explicitly intended to lay the...

Sean Takats

The Semantic Web, circa 1934

Share on

Leave a comment

You may also enjoy

More Hiring!

Now Hiring!

Tropy

Long-term Sustainability of PressForward