Category Archives: Read Hot and Digitized

Read, Hot, and Digitized: Avant-Gardes and Émigrés

Read, hot & digitized: Librarians and the digital scholarship they love — In this new series, librarians from UTL’s Arts, Humanities and Global Studies Engagement Team briefly present, explore and critique existing examples of digital scholarship.  Our hope is that these monthly reviews will inspire critical reflection of and future creative contributions to the growing fields of digital scholarship.

Avant-Gardes and Émigrés is a teaching, learning, and research initiative dedicated to the study of Russian and East European avant-gardists and émigrés in the twentieth century.
Avant-Gardes and Émigrés is a teaching, learning, and research initiative dedicated to the study of Russian and East European avant-gardists and émigrés in the twentieth century.

Avant-Gardes and Émigrés: Digital Humanities and Slavic Studies, based at Yale University, is a project that aims to develop a research initiative and prototype online environment dedicated to the study of Russian and East European avant-gardists and émigrés in the twentieth century. The project takes a number of different approaches, including topic modeling and network mapping, to explore the networks of avant-garde artists from the former Soviet Union.

One of the project’s goals is to reveal how North American academic departments in Slavic Studies have been shaped by emigration patterns of artists and intellectuals from Eastern Europe. The project uses varied approaches to explore how avant-gardists and émigrés shaped the reading practices, archival and library collections, and institutional formations of Slavic Studies as a field, and the intellectual landscape of American academia more broadly. The project also looks at how ideas from the Soviet Union influenced the dynamics of American culture during the Cold War through institutions, academic practices, theoretical approaches and methodologies, and cultural forums.

Visualization of ongoing research into the network of persons, academic institutions, publications, and archives in the Russian immigration to the United States.
Visualization of ongoing research into the network of persons, academic institutions, publications, and archives in the Russian immigration to the United States.

One of the project’s features is an interactive network map connecting émigré writers with publications and places they influenced. You can zoom in on specific regions of the map, click and drag individual elements, and click on particular nodes to see the network that the writer was connected with. The network map features universities’ departments, individual figures, and various publications, and provides an easy-to-use, visual overview of networks that would otherwise be difficult to describe.

Topic modeling algorithmically and iteratively examines the corpus of the journal Slavic Review.
Topic modeling algorithmically and iteratively examines the corpus of the journal Slavic Review.

A section of the project based around topic modeling is currently still in development, with the beta version available for viewing online. A topic model is a type of statistical model for discovering the abstract topics that occur in a collection of document, and draws from the fields of machine learning and natural language processing. The main section of this project is focused on topic modelling the Slavic Review, a major journal devoted to the study of Eastern Europe, Russia, the Caucasus, and Central Asia, but the site also features preliminary topic modeling of the Slavic & East European Journal and the Russian Review. The topic models allow users to navigate words from the publications that have been analyzed, and clicking on individual words brings up additional information about where the words show up in individual documents within their respective corpus.

The project serves as an important contribution to digital scholarship in the Slavic Studies field. Its varied approaches to visualizing and analyzing the networks it seeks to foreground provides a valuable and accessible window into these networks, making them visible in a way that is only possible through digital methodologies. I highly recommend looking through the other aspects of the project I didn’t cover, including the student contributions from the Brodsky Lab and Avant-Gardes and Emigres Digital Humanities Lab, to explore the subject matter more in-depth. I would also recommend looking through related materials in the UT Austin Libraries’ collections, including our holdings of the Slavic Review both online and in print.

 

Read, Hot, and Digitized: KITAB Project Brings Distant Reading to Middle Eastern Studies  

Read, hot & digitized: Librarians and the digital scholarship they love — In this new series, librarians from UTL’s Arts, Humanities and Global Studies Engagement Team briefly present, explore and critique existing examples of digital scholarship.  Our hope is that these monthly reviews will inspire critical reflection of and future creative contributions to the growing fields of digital scholarship.

The KITAB Project, headed by Sarah Bowen Savant of the Aga Khan University, seeks to develop tools and techniques for producing scholarship on text reuse and intellectual networks in the premodern Arabic textual tradition. The project is based on a digital corpus of published texts that represent all genres of writing in Arabic from the earliest works to the beginning of the 20th century CE. Although the corpus draws in part from digital databases of texts, it also relies heavily on digital surrogates of printed volumes which require Optical Character Recognition (OCR) for computational analysis. The KITAB project has partnered with the Open Islamicate Text Initiative to develop an OCR software that has proven more successful than commercially-available products. The collaboration’s published results of this OCR development—called Kraken—can be found here.

A snapshot of initial results using the Kraken OCR software
A snapshot of initial results using the Kraken OCR software

The KITAB project is noteworthy not only for bringing the concepts of text reuse and distant reading to Middle Eastern Studies from a digital humanities perspective, but also for its development of tools designed for Arabic script languages. The needs of right-to-left and non-Roman script languages such as Arabic, Persian, Ottoman Turkish, and Hebrew—namely bidirectionality and non-Roman script recognition capabilities—unfortunately have been neglected to date in key tools utilized by highly successful digital humanities projects. The KITAB project brings the necessity of right-to-left and non-Roman capabilities to the fore by centering the Arabic textual tradition and committing to the development of tools that best meet the needs of the questions asked.

In addition to Dr. Savant, the team behind the KITAB project includes scholars from the U.S. and Europe, notably David Smith (Northeastern University) who developed the passim software upon which the text reuse project is based, and Maxim Romanov (University of Vienna) who heads the Open Islamicate Text Initiative. The team supports the continuing evolution of algorithms that seek to determine which Arabic texts were most quoted, most used by historians, and most commented on over several centuries (roughly 700-1500 CE). These questions might be answered simply enough within one text with a full-text search engine. However, to answer these questions across the Arabic textual tradition requires not only a massive corpus (currently over 4200 items), but also incredible computing power.

The latest KITAB visualization of text reuse across two works attributed to Ibn Qutayba (d. 889 CE).
The latest KITAB visualization of text reuse across two works attributed to Ibn Qutayba (d. 889 CE).

I encourage readers to take a look at the latest text reuse visualization from the corpus, which is based on two works by Ibn Qutayba (d. 889 CE). I also suggest reading Dr. Savant’s critically reflective post on running the passim software across the entirety of the corpus, and the questions raised by the results about intertextuality and what text reuse means in the Arabic context. Lastly, I recommend that those interested and/or involved in the field review information on the KITAB Project’s corpus, including the FAQ links to the Open Islamicate Text Initiative for suggesting new digital titles and new titles requiring OCR. UT Libraries’ collection of historic Arabic texts is one of the largest in the United States and ripe with suggestions for the KITAB corpus (check out this Islamic Empire — History subject heading search to see a sample of UT’s rich Arabic collections).

 

Read, Hot, and Digitized: New Website Maps Discriminatory Redlining Practices

Read, hot & digitized: Librarians and the digital scholarship they love — In this new series, librarians from UTL’s Arts, Humanities and Global Studies Engagement Team briefly present, explore and critique existing examples of digital scholarship.  Our hope is that these monthly reviews will inspire critical reflection of and future creative contributions to the growing fields of digital scholarship.

Mapping Inequality: Redlining in New Deal America lets users visualize the maps of the Home Owners’ Loan Corporation (HOLC) on a scale that is unprecedented. The HOLC was created in 1933 to help citizens refinance home mortgages to prevent foreclosures. Directed by the Federal Home Loan Bank Board, the HOLC surveyed 239 cities and produced “residential security maps” that color-coded neighborhoods and metropolitan areas by credit worthiness and risk. These maps and the discriminatory practice they exemplified and enabled later came to be known as redlining.

Los Angeles redline map

If you zoom to Los Angeles, CA in Mapping Inequality (I recommend taking a moment to read the short introduction and how to) you will see the historic redline maps overlaid on a web-based map, a color-coded legend that describes areas from Best to Hazardous, and an information panel where you can immediately explore an overview and download raw data. Zoom in further, click a red section of the map, and the “area description” will load in the information panel. The initial view is curated and gives you an immediate impression of how these maps and accompanying documents perpetuated and institutionalized discrimination. You can also view the full demographic data and a scan of the original paperwork.

I encourage you to look at cities you are familiar with, it’s startling how the effects of these maps are apparent today. This is a work in progress so not every city surveyed by the HOLC is represented or complete.  Unfortunately, the accompanying documents for Austin are not available, but you can view the entire 1935 Austin map on the PCL Map Collection website. (You can also find a digitized reprint of the notorious Austin city plan from the 1920s at Texas ScholarWorks.)

1935 map of Austin, Texas, with redline demarcations.
1935 map of Austin, Texas, with redline demarcations.

I chose to highlight this mapping project because redlining maps are a critical example of the power of maps and this interface was beautifully constructed to illustrate their impact.

Mapping Inequality is part of American Panorama: An Atlas of United States History. While American Panorama is a project by the Digital Scholarship Lab at the University of Richmond, Mapping Inequality is a product of many collaborations. Participants from universities across the country worked on many aspects of the data collection and transcription and the Panorama toolkit, open source software used to create these maps, was developed by Stamen Design. I also recommend exploring the latest map added to American Panorama, Renewing Inequity: Urban Renewal, Family Displacements, and Race 1955-1966.