Category Archives: Resources

10 Voyant Tools Corpora of 9 MJP Magazines

I spent the day preparing Voyant Tools corpora for an in-class lab tomorrow. The following links lead to a chronological corpus of all 9 magazines currently offering TEI XML files in the MJPLab Sourceforge site. I also broke them down and offered individualized corpora by magazine to facilitate comparative analysis.

To make the datasets, I used a regular expression in TextWrangler to strip all the tags out of the XML files, and then used a command line script to batch rename them. The first attempt at the comprehensive corpus resulted in weird results on account of Voyant’s ordering the files alphabetically, so I manually renamed all 508 of them to place the publication date (yyyy-mm-dd) at the beginning of the naming convention to keep the representation of materials chronological. The individual magazine corpora are chronological on account of the volume and issue numbers having been part of the naming convention first used by Mark Gaipa.

MJP Corpora at Voyant Tools

Reproductions of Magazines

As much as I love the Modernist Journals Project, I do like to assign hard copy reproductions of magazine issues (with advertisements!) to my students whenever possible. This gives them access to some magazines that are not available on the MJP, and it mimics more closely the experience of the original readers of the magazines.

Below are links to three reproductions of magazines issues that are easily available.  I would love to hear of others, so, if you know of any, please do note them in the comments.

Blast 1


Survey Graphic, March 1925, the Harlem Number

Network Analysis, Text Mining, and Emergence in the September 1918 Little Review

(This piece is cross-posted at my course site:

This post is a shorter version of what will eventually become a longer piece about digital methods for the analysis and teaching of modernist magazines. By way of background, the occasion for this post was a workshop I did this week with two joint sessions of my graduate course in modernism and digital humanities and Sean Latham’s graduate course in modernism and new media. Sean’s course is interested in the ways in which modernist literature (and this week, magazines) embody functionalities of 21st Century digital media (i.e. the “new” new media). They have been discussing Katherine N. Hayles’ concept of emergence, through an unpublished essay of Sean’s, “Unpacking My Digital Library: Programming Modernist Magazines,” forthcoming in Editing Modernisms in Canada, eds. Colin Hill and Dean Irvine.

Emergence describes “a particular kind of complexity that arises not from the individual elements of a system, but only from their interaction” (15). It emphasizes the interactive system of meaning that derives from the connections among various content items of a magazine, and only from those connections. In arriving at this sense of a dynamic readerly coherence, he uses Espen Aarseth’s concepts (from Cybertext ) of “texton,” a string of information that exists in the text (a poem, an advertisement, a headline, etc.), and “scripton,” their idiosyncratic assemblage by the reader.

Bound up in these concepts is the nature of magazine reading itself. Magazines are not codices. They are not books to be read serially from cover to cover (although a few of us 21st-Century denizens who study magazines might admit with blushed cheeks that we do).

So, our workshops looked at social network analysis and text mining as ways of potentially identifying, or at the very least of recording and analyzing, scriptons that might emerge in the magazines.

We picked the September 1918 issue of The Little Review at the Modernist Journals Project (MJP) because it features one of the few direct references to the First World War, W.B. Yeats’ poem “In Memory of Robert Gregory,” an Irish airman. It contains other items on the theme of death, including James Joyce’s “ULYSSES Episode VI,” later known as “Hades,” depicting the funeral of Paddy Dignam. However, there is a number of other items that deal with the themes related to death, such as two short stories by Sherwood Anderson and Ben Hecht, respectively titled “Senility” and “Decay,” and which immediately follow Joyce’s installment. Aside from these editorially juxtaposed pieces, however, are numerous items of criticism or correspondence that rail against literary obsolescence as a kind of death, if not in so many words. For instance, it emerged in our discussion that Edgar Jepson’s essay “The Western School” is talking about the deterioration of contemporary literary production in a way that is evocative of death and shares valences with the ways in which other pieces deal with death more explicitly. Most importantly, it was only through the comparison with the other pieces dealing with death — which appear later in the issue — that we were able to read it in concert with the emergence of death at all.

Timeline of Student-Curated Data from MJP

For our workshop, I prepared a dataset from information my students have culled from the MJP. These data derive from an interactive timeline project that my students in periodical studies have done in four different courses at three different universities. The students curate content in the MJP by entering items and their bibliographic data into a shared Google Docs spreadsheet. The bibliographic data include author, genre, page numbers, and publication date, the latter of which places the item on the timeline. More importantly, students assign topic tags to each item in order to provide a sense of its meaning. In the timeline, readers can click on an item to view a description and link to its location in the MJP. The timeline is additionally surrounded by filters from the data types (author, genre, magazine, topic tag) that allow the reader to refine further her exploration of the data. The idea is that codifying the metadata into the timeline will allow for discoveries and provoke questions as more and more content is entered. This is one possible technology for uncovering and articulating emergence within and across magazines.

Using some (carefully massaged) data from the timeline, I made some CSV files to feed to Gephi for the generation of network graphs. While the timeline interface separates connections in time, and therefore also in space, Gephi presents them in a 2D, timeless space so that all are apparent. In the interest of transparency, I also posthumously added the Death tag and some other ones that reflect our discussion from the first day. These include Greatness and Mediocrity (among others), since we noticed that Yeats’ poem takes pains to point out how much Gregory had in fact not accomplished relative to his more prolific peers.

Network graph for the September 1918 <i>Little Review</i>, showing nodes for item title, author, genre, and topic tag.

So, with these connections in mind, we get this bibliographic and thematic overview of the September 1918 Little Review. The image uses the Fruchterman Reingold layout algorithm (see here for information about Gephi layouts) to place the more highly connected nodes in the center, grouped by edge weight. That means the nodes that have more connections with each other will be in closer geographical proximity. One pattern that emerges is the relative distance of the genres Poem and Short Story. They appear on opposite sides of Death and, aside from that, have nothing in common thematically or in terms of contributors. In Gephi, one can mouse over a node in order to view its nearest neighbors (a degree separation of 1) in context of the larger graph (see here).

Ego network of Death, mousing over Short Story.

That effect becomes even more pronounced if we generate an ego network of Death, with the node sizes re-set for context, and mouse over different terms within it so as to highlight their immediate connections. The disparity between Poem and Short story becomes even more clear. In this picture, which shows the micronetwork that emerges when mousing over the Short Story node, we see that the Short Story genre has virtually nothing in common with any other content.

Ego network of Death, mousing over Poem.

On the other hand, mousing over the Poem node reveals a much wider micronetwork that connects Yeats’ poem with several works by Eliot, Jepson’s Essay, and the triad of Greatness, Mediocrity, and Irony. Why would Margaret Anderson and Jane Heap apparently use the short stories to carve out a separate space for the lowly and dissolute? Is it part of a strategy to explore different aspects of death and dying, using generic properties to present different facets?

Ego network of Death, mousing over Novel.

Interestingly, the Novel and Essay genres bear nothing in common with Short Story but have several connections with Poem, particularly in the topics of Greatness, Poetry, and Art, and in featuring Irony as a device. This graph shows what happens when we mouse over Novel. It would seem that the short stories in this issue are more straightforwardly realistic about death and dissolution than their longer form and poetic peers.

Overview of September 1918 <i>Little Review</i> with Yifan Hu Proportional layout.

The isolation of the Short Story group is even more pronounced if we change to the Yifan Hu Proportional layout algorithm, which calculates centrality and repulsion in such a way that clusters become apparent while emphasizing the differences as outlying branches (see here for more information about Gephi layouts). The bottom portion of this image shows the two clusters of the Hecht and Anderson stories as they attach with Death, while the entire field bears no relationship with them. Likewise, Ford Madox Hueffer’s installment of Women and Men bears little relationship with the rest of the issue, constituting an outlying branch of the Novel node not connected with Death. Conversely, Joyce’s installment of Ulysses is quite well integrated with other memes in the issue, while Heuffer’s is the only literary piece not connected with Death.

Overview of September 1918 <i>Little Review</i>, Yifan Hu Proportional layout, mousing over the Advertisement node.

It is interesting to note what else is not connected with Death. The advertisements at the back of the issue, one of them for a Hammond typewriter that emphasizes the Greatness of Literature, as if the tool could somehow make the buyer a great writer (“No Other Typewriter Can Do This”), but obviously without the Ironic representation of Mediocrity that characterizes much of the actual literary content. The ad for Mason & Hamlin, “The Stradivarius of Pianos,” also emphasizes Greatness as a selling point. In thinking about these relationships, the ads seem to represent the lack of Ironic insight into worthiness and Mediocrity that take front and center in Yeats’ poetic argument. While we can’t know what the editorial intent was in placing these objects together (or if there even was one), what we can be sure of is that a system which enables readers to tag content semantically can help to provoke new questions that might be worth going back to the magazines to investigate.

I raise the latter issue about advertisements because they constitute a part of the emergence of Death that was not discussed in class. It occurred to me as I was massaging the spreadsheet to feed to Gephi and thinking about the pieces we had read. This would be an example of how collaborative markup, say of a small working group of scholars or even one comprising all the members of our field, can aid in the discovery of emergences utilizing artifacts we might not individually have noticed or thought of as relevant. This would be a reactive and not so much a predictive method, one that utilizes both the stable bibliographic data as well as the idiosyncratic scriptons assembled by readers.

I would like to suggest, though, that a predictive method might be found in text mining. Using the Voyeur Tools for analyzing corpora, we can see the chronological surges in word frequency over the entire Little Review corpus in the MJP. A spike in word frequency in a given issue might mean that we are more likely to generate scriptons related to that word. As a brief example, see this Voyeur corpus of The Little Review from its beginning in 1914 through the Winter 1922 number (use the gear cogs to apply the Taporware stop word list, and be sure to make the change globally).

Voyeur Tools Word Trends Visualization of <i>The Little Review</i> between March 1914 and Winter 1922.

After applying the stopword list (and making a few other manual removals), this picture shows the raw frequency trends of the top five reoccurring words in the currently available segment of The Little Review. The word life has the top overall frequency in the corpus, but its usage declines precipitously with the start of the First World War. What does the trend look like for the word death, and is there a significant pattern around the September 1918 issue, as above, or perhaps at different key moments in the War?

Larger version of <i>Little Review</i> word trends.

A much bigger, live version of this graph allows us to gain more information by mousing over and clicking. The word art has a massive and unique surge in Volume 3, number 8 (January 1917). The reason for it is that Jane Heap took over as content editor for that issue with a round of essays on art and aesthetics. Although a 3- or 4-issue arc of the run bears a higher focus upon art, the word (and the subject?) drop almost completely for a bit and then return to a normal pattern for the rest of the run, with some higher spikes later on. I will address this sort of method in more depth, looking specifically at how we can locate emergences in the issues to which the graph directs us.

BBC Announces Digital Archive of The Listener Magazine

It was just announced that the BBC is launching a digital archive of The Listener, its radio magazine that ran from 1929 to 1991.

The Listener not only published the BBC’s programming schedule and promoted upcoming radio content, but featured many writers from the Bloomsbury Set such as Virginia Woolf, E.M. Forster, and T.S. Eliot. It was also at the forefront of the popular science industry, explaining and promoting theories such as Einstein’s relativity, quantum mechanics, and wave-particle duality to a generally educated audience. The Listener, along with BBC pamphlets and other related ephemera, not to mention the BBC’s signal itself, had a wide reach into the European continent and had a broad impact on discourses in all areas of culture, both mainstream and avant-garde.

This resource will make it much easier to study the history and culture of the 1920s and 1930s, and beyond.

Caribbean Newspaper Digital Library

The Caribbean Newspaper Digital Library looks like an amazing resource for digitized periodicals from the Caribbean, including Cuba’s El Diario de la Marina (with issues from 1899) and Haiti’s literary journals La ronde and La nouvelle ronde (with issues dating from 1901). (Caveat: The quality of the digitization seems a bit mixed, and I’m not sure how searchable the issues are.)

Thanks to the Black Atlantic Resource Debate Blog, where I first learned of this periodicals resource. Some highlights of the digital library are listed here.

Database of Modernist Periodicals: Announcement and Question

I’m pleased to announce that I have begun planning a comprehensive database of Modernist Magazines to be called, “The Database of Modernist Periodicals.” This database was inspired by Scholes and Wulfman’s important contribution to periodical studies, Modernism in the Magazines.

I will make a more detailed announcement this spring, but in the meantime, the database will be designed to be a community undertaking. Much like Turbotax, the database will lead contributors through a series of questions in order to produce a bibliographical correct entry on any modernist magazine. As the database grows, we hope to implement network analysis tools to make it a robust teaching and research environment.

Later this spring, I will ask all of you to look over the draft document and make your own suggestions as to what YOU would like to see in the database.

Finally, I’m looking for a logo for this database. To start this project in a collaborative manner, I would like to ask you all to send me suggestions for “iconic” images of the modernist period published in magazines before 1923 (links to these images would be greatly appreciated).

I look forward to sharing more with all of you, and I wish you all the very best for this coming year.

American Periodicals from the Center for Research Libraries

Wondering if anyone else received the following announcement from ProQuest, which has partnered off with the Center for Research Libraries.  Comments on the periodicals offered?

Here’s the live link:


It’s here! American Periodicals from the Center for Research Libraries
Now you can expand your research capabilites with a new full-color, full-text online historical periodicals resource made possible via an innovative partnership between ProQuest and the Center for Research Libraries (CRL), a consortium of North American universities, colleges, and independent research libraries. This essential collection contains archival quality scans of journal content that can be cross-searched with leading ProQuest collections such as American Periodicals Series Online and ProQuest Historical Newspapers. Upon completion, American Periodicals from the Center for Research Libraries will contain three million pages that can illuminate your American history research. Click here to request a trial for your library.

Continue reading