For almost three hundred years, the Spanish monarchs ruled over an expansive empire stretching from the Caribbean to the southernmost tip of South America. World history narratives situate Spain within a centuries-long clash between major powers over territory, resources, and authority in the Americas that ended with the wars of independence. However, these histories tend to devote less attention to the day-to-day processes that sustained imperial rule. My dissertation explores this question through an analysis of the underlying mechanisms that bound the people to their faraway king. A LLILAS Benson Digital Humanities Summer Fellowship helped me to create an online exhibition that demonstrates what the bureaucracy of empire looked like on the ground. (Visit the Spanish version of the exhibition.)
This interactive website serves as an interface with a section of the vast holdings of the Benson Latin American Collection: the Genaro García Collection. Through the exhibition, teachers, students, and community members can explore the events that unfolded when the king ordered a visita—or royal inspection—for New Spain (roughly, modern Mexico) in 1765. The inspection allowed the monarch to keep up to date on local happenings while also identifying areas that could be reorganized. This visita involved approximately seven years of examinations and reforms carried out through a cooperation between the monarch’s appointed visitador—or inspector—and local government workers.
The website offers high-resolution images of the thirty documents from the Genaro García Collection that pertain to this procedure, in addition to brief content descriptions, full transcriptions, information on the individuals involved, and maps of prominent regions mentioned in the sources. All of this information appears in an interactive timeline so that users can experience the process of bureaucracy at work.
This project benefited from the use of several digital humanities tools, including TimelineJS, FromthePage, and Transkribus. TimelineJS allowed for the creation of an interactive chronology containing the step-by-step process that the visitador followed as he inspected and reorganized the government of New Spain. For users looking to examine the documents beyond the site’s overviews, FromthePage and Transkribus generated full transcriptions of the sources.
These texts provide opportunities for further exploration, such as data analysis. For example, by feeding the transcriptions into the Voyant Tools website, I was able to generate a word cloud of the most commonly appearing words and phrases in the documents.
The Benson Latin American Collection holds documents covering many regions of the Spanish world across the sixteenth through the twenty-first centuries. During this time, Spain’s hold over its American territories required the constant interaction between royal officials and local populations, and that crossover was often messy. The 1765 visita of New Spain sheds light on the complexities of this process. My hope is that this online exhibition will expand the ways in which people can interact with these sources without having to visit the University of Texas campus in person, and learn from them about the day-to-day experience of imperial management.
Brittany Erwin is a PhD candidate in history. She was a LLILAS Benson Digital Humanities Summer Fellow in 2020.
Game-changing innovations that use artificial intelligence (AI) tools will improve access to Indigenous and Spanish colonial archives. “Unlocking the Colonial Archive: Harnessing Artificial Intelligence for Indigenous and Spanish American Historical Collections” is a collaborative project led by LLILAS Benson Latin American Studies and Collections at The University of Texas at Austin, the Digital Humanities Hub at Lancaster University, and Liverpool John Moores University. The project will transform “unreadable” digitized Indigenous and Spanish colonial archives into data that will be accessible to a broad spectrum of researchers and the public.
The project will be funded by a $150,000 collaborative grant from the National Endowment for the Humanities (NEH) as well as €250,000 (approx. US$304,000) from the UK’s Arts and Humanities Research Council (AHRC) through the joint New Directions for Digital Scholarship in Cultural Institutions program. Kelly McDonough, associate professor in the Department of Spanish and Portuguese, and Albert A. Palacios, digital scholarship coordinator at LLILAS Benson, will manage the project at UT Austin.
The Benson Latin American Collection at The University of Texas at Austin possesses one of the world’s foremost collections of colonial documents in Spanish and Indigenous languages of Latin America. Yet even when digitized, such documents are often neither searchable nor readable because of calligraphy, orthography, and the written language of the document itself. In tackling this problem, the collaborators propose to employ and develop interdisciplinary data science methods with three goals in mind: to expedite the transcription of documents using cutting-edge Handwritten Text Recognition technology; to automate the identification and linking of information through standardized vocabulary ontologies using Linked Open Data and Natural Language Processing techniques; and to facilitate the automated search and analysis of pictorial elements through Image Processing approaches.
The research will be based on three digital collections under the aegis of LLILAS Benson and one from the National Archive of Mexico. The LLILAS Benson collections are digitized Benson Collection colonial holdings, including the Relaciones Geográficas, 16th-century painted written and pictorial documents describing the geography and peoples of New Spain; the Royal Archive of Cholula at the Archivo Judicial del Estado de Puebla (Mexico), which was digitized through a Mellon-funded post-custodial grant; and the Primeros Libros de las Américas, a digitized collection of books published in the Americas before 1601.
McDonough and Palacios say that the project will further colonial Latin American studies not only at UT, but beyond, significantly facilitating the discoverability and interpretation of these materials. “While the work will begin with collections at the Benson and its Latin American partners, the technology developed will be accessible to libraries and archives worldwide, who can use it to automatically transcribe their digitized manuscripts,” Palacios said. In addition, “through the public workshops that are part of this project, we will train humanists on new innovative approaches that leverage the potential of machine learning to facilitate research,” McDonough added.
The geographical diversity among the project’s leadership and collaborators reenforce its global reach. The PIs are McDonough and Palacios of UT Austin, Patricia Murrieta-Flores of Lancaster University (UK), and Javier Pereda Campillo of Liverpool John Moores University (UK). Other collaborators hail from Germany, Mexico, Poland, Portugal, Spain, and Switzerland. Among the numerous participants from Mexico is Lidia García Gómez, history professor at the Benemérita Universidad Autónoma de Puebla, who was involved with the digitization of the Royal Archive of Cholula.
For more information: Susanna Sharpe, Communications Coordinator, LLILAS Benson, The University of Texas at Austin
Read, hot & digitized: Librarians and the digital scholarship they love — In this new series, librarians from UTL’s Arts, Humanities and Global Studies Engagement Team briefly present, explore and critique existing examples of digital scholarship. Our hope is that these monthly reviews will inspire critical reflection of and future creative contributions to the growing fields of digital scholarship.
A twenty-two-year program that began during World War II and is still relevant nearly sixty years after its conclusion in 1964, the Bracero Program was an agreement between the U.S. and Mexican governments to permit short-term Mexican laborers to work in the United States.
In an effort to stem labor shortages during and after the war years, an estimated 4.6 million workers came to the USA with the promise of thirty cents per hour and “humane treatment.” Of course, we know that loosely defined terms like “humane treatment” present a slippery slope that can erase and omit stories. Fortunately, through the collaborative efforts of the Roy Rosenzweig Center for History and New Media, George Mason University, the Smithsonian National Museum of American History, Brown University, and the University of Texas at El Paso’s Institute of Oral History, many of those once-hidden stories have been preserved and made accessible through the Bracero History Archive (BHA).
The BHA offers a variety of materials, most notably over 700 oral histories recorded in English and Spanish. While the metadata fields for each oral history could be more robust, the ability to hear first-hand accounts and inter-generational stories is a dream come true for primary source-seekers. All audio is available to download in mp3 format for future use.
Apart from oral histories, other resources are also available. Images, such as photographs and postcards, provide visuals of the varied environments that hosted the Braceros as well as portraits of the Braceros themselves.
Again, further detail on these resources would benefit the archive. For example, the photograph above, titled “Two Men,” demonstrates a lack of context needed for a more profound understanding while also acknowledging the potentially constant transient nature of Bracero work. In fact, the very word bracero, derived from the Spanish word for “arm,” is indicative of the commodification and dehumanization of the human body for labor. Workers lived in subpar work camps, received threats of deportation, and lacked proper nourishment, especially given the arduous work conditions.
Additional BHA resources include a “documents” section in which offspring share anecdotes about the Bracero Program and track down information about loved ones. Finally, the site offers resources for middle school and high school teachers to use in their curriculum. Here again is an opportunity to further build out the site for university-level instruction.
The digital objects in the BHA are worthwhile for those looking to recover an often-overlooked subject in American history that still resonates with themes relating to immigration today. Indeed, farmworkers continue to be exploited and underappreciated despite their contributions to society. This has led to a number of movements, marches, and boycotts in efforts to improve living conditions and wages.
If there’s a single lesson to take away from this year, it’s that libraries are a lot more malleable than their long history may have given them credit for.
We’ve previously covered the Herculean effort by University of Texas Libraries’ staff to pivot from their natural in-person work environs to a distance service, then a subsequent limited return to the former, but a lot of that agility was due in no small measure to underlying efforts that were already underway when the health crisis washed over campus and the country.
Strategically, this institution has been focusing on the idea of the library as a platform: not just a storehouse for books or website of searchable journals, but an active ecosystem where resources, tools, services, spaces, expertise and community intermingle with a constantly variable presence of users to spin off scholarship and innovation back into the world. This idea factory of ever-evolving components works at its best when it creates opportunities for discovery through constant interaction of the various parts.
With the pandemic creating greater physical distance between the parts, though, it’s become essential that we focus on those tools that could best allow us to reach our users where they are, be that in an apartment in West Campus, or on the other side of the globe.
Last year, we announced the launch of a pair of systems designed to organize, preserve and create accessibility for digital iterations of physical materials that otherwise would only be available to people who could visit the Forty Acres. Our Digital Asset Management System (DAMS) was deployed in September, 2019, and in November, we published the Collections Portal on the Libraries’ website. The culmination of these two projects proved to be far more fortuitous than we could’ve imagined.
A couple months later as leadership at the Libraries was fleshing out a new strategic plan that placed special emphasis on the concept of Libraries as platform, the first case of coronavirus was discovered in the Pacific Northwest. Then, in March as the spread of the pandemic began to accelerate, The University of Texas at Austin announced first the delay of spring classes, followed quickly by a directive to move all but the most critical staff to remote work away from campus, and to shift to online learning for the remainder of the semester.
More than ever, the adaptability of the Libraries to changes in user behaviors was the institutional characteristic that needed to be positioned in response to the extraordinary situation that fell so quickly upon us all. And refocusing our collective energies on tools with the greatest potential to serve the largest number of people while considering the long-term goals of the Libraries made these new systems a natural priority for applying institutional resources.
At its most basic, a Digital Asset Management System is a locally-developed digital repository designed to store, describe and manage digital assets of the Libraries. Digital assets are comprised of a primary digital files like scanned images, book pages, audio or video recordings, with varying component parts: metadata, or data about the data that includes information about the origin of the file, specifications and descriptive data used for locating the asset; additional secondary files that can be machine-readable and/or provide additional technical information; and derivatives, such as thumbnail images, other file versions, and PDFs.
The DAMS serves as the central preservation and management hub for Libraries’ digital assets, built by the Libraries Information Technology Support (LITS) team in coordination with staff library professionals, who also manage the operations of the system. The DAMS project began in 2016, and in an effort to prioritize two of our most notable collections, staff at the Benson Latin American Collection and the Alexander Architectural Archive began preparing digital collections for the system.
“The digital asset management system was many years in the making,” says Jennifer Lee, Director of Discovery and Access. “And for many, many years before that it was just an idea, like an item on a collective wish list. Now, it’s become a reality. And over the past seven months in particular, we’ve made excellent progress on adding content.”
The Collections Portal
The Collections Portal serves as an access point on the Libraries’ website allowing users to undertake remote research and study utilizing rich resources that have previously only been available in person or through more time-intensive digitization on demand processes.
Developed in 2018-19 by LITS in close coordination with other Libraries professional staff as a logical progression from the DAMS, the Portal provides students, faculty, researchers and the broader public access to collections that have not been directly available in the past, and the project’s infrastructure creates a framework for a more consistent stream of new digital content in the future. Each item in the portal also contains contextual data – drawn from the DAMS – in order that users may learn underlying information about the material, locate physical counterparts and determine reuse rights for digital files.
The relationship between the DAMS and the Portal can create confusion since both systems deal with the same assets, but it’s useful to think about the interrelationship between the parts. The DAMS is the back-end storage and management environment, where preservation, description and accessibility of the resources are controlled. The Collections Portal draws on the information contained within the DAMS to make some of the content that exists there discoverable and accessible for remote use through a public web interface. The dual structure allows for our staff to determine what is suitable for partial or full public access based on issues like copyright or embargo status.
“These two are separate but closely connected software systems,” explains Mirko Hanke, Digital Asset Management System Coordinator, who has been one of the driving forces behind efforts to refine and build out the systems. “This overall architecture of having two separate systems allows the curators to choose which of the content they’re managing in the DAMS they want to make publicly available.”
Both systems were implemented by LITS staff using open source software components and they built software to bridge the two systems from scratch.
The basic workflow for getting items from the shelves into the systems involves digitization, file management, metadata creation and ingestion.
The Libraries has been digitizing physical materials for decades, including thousands of items that were digitized previous to the development of the DAMS, and those files can be retrieved and processed for inclusion in the new systems. Accessing the digital forms of materials can extend the life of fragile special collections and makes near-immediate global access possible. Physical materials are often reformatted as digital files in their entirety to minimize handling and ensure future access to unrequested sections at a later date. Additional processes in digitization allow for the enhancement of usability of the digital iterations, as well, including optical character recognition, making scanned documents searchable and information contained within more easily findable. The automation of many digitization processes makes pagination and file structuring more manageable and speeds up ingestion and thus accessibility of content.
Requests for digitization are made either through a formal submission or directly to Libraries’ Digitization Services, with special priority given to our two notable special collections – the Benson Latin American Collection and the Alexander Architectural Archive – both of which are heavily used by the public and thus have significant back catalogs of digitized materials, making them fertile resources for populating the DAMS and Collections Portal. Special consideration has also been extended to time-sensitive projects, such as those slated for exhibition loan or items that are being or have been retired from other access points.
Once files have been digitized, they are passed through specialized workflows based on the type of content and its historical origin that add and/or enhance metadata, secondary files and derivatives to create singular digital assets that can then be ingested into the DAMS and potentially projected out to the Collections Portal.
Staff professionals working with LITS professionals have developed scripts and processes that can help to speed up the packaging of digital assets both for newly digitized items, but also from previously digitized materials that exist from earlier Libraries efforts. There is ongoing work to track digitization, management and ingestion processes to create ongoing improvements to the workflows.
Hitting the Gas
Realizing the important potential of the two systems for remote users in response to the health crisis, the Libraries reconfigured workflows and redirected staff to accelerate work already occurring to populate and invigorate the DAMS and by extension, the Collections Portal. The first order of business was to formalize workflows to prioritize the digitization and processing of materials.
Resources at the Benson and Alexander Archive proved to be low-hanging fruit for their outsized use in research and because of existing expertise in digital preservation, so projects originating from those collections received significant attention.
Staff at the Benson Latin American Collection have been working on a project to digitize the Genaro García Collection – the Benson’s massive foundational collection, acquired in Mexico City in 1921 by university representatives on a diplomatic visit. The Libraries will next year be celebrating the 100th anniversary of that acquisition as the establishment of Latin American collections on campus, so the effort to provide online access to this important collection made it a priority for addition to the Collections Portal.
“Because we’ve established some good local practices for collection creation and we have a set of well documented requirements on the DAMS ingest side, it becomes much easier to develop batch processing workflows to prepare scans and metadata for upload into the DAMS without manipulating each collection object, one at a time,” says David Bliss, Digital Processing Archivist at the Benson Latin American Collection.
A team-based approach was coordinated by Latin American Archivist Dylan Joy. Staff Photographer and Library Specialist Robert Esparza spent several months carefully digitizing the Genaro García Imprints and Images collections in their entirety, following a process developed locally at the Benson. Concurrently, GRA Diego Godoy compiled item level metadata based on a template developed by Metadata Librarian Itza Carbajal. Bliss then worked to develop a script for ingesting the scans and accompanying metadata from the collection into the DAMS, bypassing hours of monotonous and error-prone work in favor of a process using existing metadata in a hands-off approach that occurs in minutes instead.
“We didn’t just wake up one day and decide to make our file naming practices more consistent and systematic or suddenly realize that we should be gathering good metadata,” says Bliss. “This kind of scripting work is only possible because significant resources were dedicated to equipment and project staff.”
Benson staff, in coordination with Libraries’ Content Management and Digitization Services teams, have worked prodigiously on the Benson Rare Book Collection, including the high visibility Primeros Libros – the first books published in the Americas prior to 1600; so far, 21 full volumes are published to the Collections Portal, with more in process. Libraries Technology Coordinator Benn Chang worked with Benson Latinx Studies Archivist Carla Alvarez to make newly available several hundred previously digitally-preserved photographs in the George I. Sánchez papers, which are now part of the Collections Portal, as well.
“This work really does take a village and there is no one singular workflow or approach that suits all collections,” says Benson’s Head of Digital Initiatives Theresa Polk.
At the Alexander Architectural Archive, staff have been working to process both newly-digitized and legacy digital assets. “Architectural collections staff have worked closely with Digitization Services to adjust our workflow to include ingesting assets and metadata into the DAMS,” says Archivist for Access and Preservation Stephanie Tiedeken. So far, over 21,000 assets have been ingested into the DAMS from the Alexander Archives and Architecture & Planning Library’s Special Collections, and over 2,000 of those have been published into the Collections Portal, including 270 publications and over 1,800 digitized drawings or photographs.
Archive staff are also working to move legacy assets into the DAMS. The Alexander’s GRA, Alyssa Anderson, recently completed a project to ingest 262 legacy images of scanned drawings and photographs from ten sites, primarily missions, in Texas and Mexico images and create MODS metadata. Now that these items are in the DAMS, they are more usable and visible to researchers.
Head of Architectural Collections Katie Pierce Meyer worked with Mirko Hanke and staff from Digitization Services to develop a process for ingesting legacy digitized photographs from the David Reichard Williams collection, a regionalist and architect who documented vernacular architecture in Texas in the 1920s and 1930s. Colleagues from Libraries’ Branch and Borrow Services transferred data from finding aid, added descriptions of photographs, bringing expertise and fresh eyes to these historic images of buildings and places across the state.
Building on transformation processes and documentation work previously done by David Bliss and Benn Chang, and working closely with Mirko Hanke, Pierce Meyer was able to take the data, map it to DAMS metadata fields in the data editing tool OpenRefine, then export it and create individual metadata files for each image. The image and the metadata files could then ingested and published in large batches.
After materials were ingested from the David Reichard Williams photography collection at the Alexander Archive and became available via the Collection Portal, colleagues in Content Management conducted quality assurance on the ingested data and enhanced the metadata. Finally, Alexander Architectural Archives’ Curator Beth Dodd introduced these published assets to historic preservation professionals and donors to the Alexander Archives, who provided additional information to further describe and enhance information about the buildings in the photographs. Over the course of the project, the crowdsourced assistance of many participants have been instrumental to ingesting assets and enhance the metadata, making for a more robust and discoverable resource for future researchers.
“The Williams project has been a particular example of a collaborative, iterative process to transfer our legacy assets to the DAMS and publish them to the collections portal. It has also been a great learning opportunity and we are taking what we have done here to inform future collaborative work with our collections and metadata transformation” says Katie Pierce Meyer.
Another extremely visible digital collection has also played a significant role in the growth of DAMS and Collections Portal content. The PCL Maps Collection – which is celebrating its 25th anniversary this year – is perhaps the most heavily used of our collection, largely due to the 70,000 items that are available through the Libraries’ legacy website. Visitation to the online maps has accounted for over 50% of all Libraries’ web traffic at points, and has exceeded 5 million views with consistent frequency. The Libraries’ launched a new website in 2018, and have begun to migrate the Maps Collection into the DAMS where it will be available through the Collections Portal. The legacy website remains active largely to maintain access to the collection, so ingesting the digital content from the Maps Collections is another high priority for the overall project.
The migration of the collection into the DAMS is providing the opportunity to greatly improve upon the associated metadata and, in some cases, to provide even higher quality digital scans for use by researchers. “In the DAMS we can store and serve larger format images, which is a great improvement and there are established organization standards, where the legacy site grew organically from its early adoption roots,” says Maps Collection Coordinator Kat Strickland. “Many of the maps in the collection have made their way here without any context. So being able to show somebody the image and describe with more robust metadata is also going to improve discoverability for people.”
“The DAMS is going to benefit users because collections can be organized in a way that will help users find the context of individual maps by linking to a subcollection of related maps.”
When the university shuttered operations in March and physical access to the Maps Collection was halted, only 77 items had been migrated to the DAMS. A short seven months later, there are over 14,000 maps in the system and Libraries’ staff are currently working on metadata for another 11,600 to make those available.
That experience mirrors the shift in focus since remote work has become the prevailing mode of service at the Libraries and online content has become the primary resources for users. In March, there were approximately 2,500 digital assets available through the Collections Portal. Today, there are over 20,000 assets available through the Collections Portal, and those numbers are expanding apace as more resources are committed to the work and staff adapt innovative approaches to their processes.
“There’s been an eightfold increase in content since March, which is just amazing progress and wouldn’t have been possible without the support of many colleagues,” says Mirko Hanke.
Read, hot & digitized: Librarians and the digital scholarship they love — In this new series, librarians from UTL’s Arts, Humanities and Global Studies Engagement Team briefly present, explore and critique existing examples of digital scholarship. Our hope is that these monthly reviews will inspire critical reflection of and future creative contributions to the growing fields of digital scholarship.
Creating and publishing open access linguistic data is an invaluable way to support research in digital approaches to linguistics, and to lend support to making more scholarly research openly available to a broad audience. Bulgarian Dialectology as Living Tradition contributes to this body of open access data with its searchable and interactive database of oral speech in Bulgarian, representing a wide range of dialects recorded in 69 different Bulgarian villages. Data is presented in the oral recordings themselves and in the 184 transcriptions of those recordings, with a variety of features–such as tokens with associated tags for grammatical, lexical, and linguistic trait information–available for each text. This collection of Bulgarian linguistic materials is an important resource for studying the language, and the project will be of interest to anyone interested in computational linguistics, digital approaches to studying and analyzing languages, and, of course, in Slavic languages.
The site breaks down its texts into lines, which are themselves comprised of associated tags for grammatical, lexical, and linguistic trait information. Each text can be viewed in three ways: the Glossed View, which shows tokens with grammatical information, English glosses, and Bulgarian lemmas; the Line Display, which shows a line of text and its English translation; and the Cyrillic Line Display, with the original Bulgarian lines in Cyrillic script. In addition to these views, there are five types of search available to users; from the website: the wordform search, lexeme search, linguistic trait search, thematic content search, and phrase search.
This project succeeds at its goal “to return the focus of dialectology to its source in living, natural speech, to provide a broad, representative covering of this speech throughout the chosen region, and to make this material accessible to a wide spectrum of users.” The use of field recordings not only makes these recordings broadly accessible in a way that may be difficult absent digital technologies, but allows users, whether casual browsers of the site or researchers in an academic setting, to hear the language and its many dialects as it is actually spoken. The foregrounding of this dialectal speech in its “natural village context” forwards forms of a language that are often markedly different from standardized, more urban ways of speech.
The site’s creators took care to make the documentation of the site’s creation available publicly, so that others who might wish to create similar digital collections could draw on the work. The site was developed using the open source content management system Drupal, a framework that allows a greater ease of reproduction/repurposing of work and which furthers the goals and values of open source software development by creating a healthier, more robust ecosystem of scholarship and digital humanities work using freely accessible technologies.
The project serves as an important contribution to digital scholarship in Slavic Studies. The large volume and unique content of the recordings and texts make for a valuable corpus, and the creators’ commitment to supporting other projects by using open source software and making their documentation on the site’s creation publicly available is also very admirable. I hope to see it inspire other projects that likewise support open source within the digital humanities.
For more information on digital linguistic methods, open source projects, and the Bulgarian language, please consult the UTL resources below:
Esse foi o Verão do Zoom nos Estados Unidos. Qualquer pessoa cujo emprego tenha passado de presencial para quase totalmente virtual nesse curto espaço de tempo já sabe como é (a) o isolamento, (b) a sensação constante de que não vai dar conta das coisas, (c) a overdose de videoconferências, ou (d) pelo menos uma das opções acima, senão todas ao mesmo tempo. Mesmo assim, a possibilidade de interagir com outras pessoas em plataformas tipo Zoom acabou nos permitindo avançar em certas áreas bem importantes. Esse foi o caso da recente série de oficinas conduzidas pela equipe de Iniciativas Digitais da LLILAS Benson (LBDI) com suas entidades arquivísticas parceiras na América Latina.
As oficinas foram originalmente concebidas para acontecer presencialmente durante um retiro de uma semana para todo o grupo de arquivos latino-americanos parceiros. O local escolhido foi Antigua, na Guatemala. Como atividade essencial da grant de dois anos da Fundação Mellon, intitulada Cultivating a Latin American Post-Custodial Archiving Community (Criação de uma Comunidade Arquivística Pós-Custodial Latino-Americana), a ideia era usar essa semana para criar uma oportunidade especial para essas entidades, cujas sedes são a Guatemala, El Salvador, Colômbia e Brasil. O retiro proporcionaria várias sessões de treinamento, intercâmbio de recursos e conhecimentos, troca de ideias e discussões sobre desafios que elas enfrentam em seus trabalhos.
A grant da Mellon é para o período de janeiro de 2020 a junho de 2022 e subsidia os trabalhos arquivísticos pós-custodiais* executados em parceria com cinco arquivos selecionados, alguns dos quais já se encontram representados no repositório da Latin American Digital Initiatives. Esse repositório enfatiza coleções que documentem temas de direitos humanos e comunidades subrepresentadas.
A pandemia do Covid-19 exigiu que a equipe de iniciativas digitais começasse a se articular e tomasse decisões rápidas para manter o ritmo do projeto no âmbito do cronograma da grant. O resultado foi essa série de oficinas oferecidas via Zoom, que exigiu dos membros da equipe LBDI a produção de vídeos completos de treinamento, concepção de sessões Q&A e agendamento de sessões com especialistas convidados. Os tópicos eram a montagem e redação de grants, preparo de orçamentos, processamento arquivístico, metadados, seleção de equipamentos, preservação digital e formação em tecnologia digital, entre outros.
Durante cinco semanas desse último verão americano, os participantes da oficina se reuniram duas vezes com Theresa Polk, David Bliss, Itza Carbajal, Albert Palacios e Karla Roig, todos membros da equipe da LBDI, com a presença adicional de Megan Scarborough, administradora de grants da LLILAS Benson. Todas as sessões foram conduzidas em espanhol com tradução legendada para o português (ou vice-versa) a cargo de Susanna Sharpe, coordenadora de comunicações da LLILAS Benson. Outros apresentadores foram Carla Alvarez, arquivista U.S. Latinx da Benson Latin American Collection, e duas especialistas em preservação de fotografias, Diana Díaz (Metropolitan Museum of Art) e María Estibaliz Guzmán (Escola Nacional de Conservação, Restauração e Museografia, ou ENCRyM, no México).
Apesar da distância física, ficou claro o alto valor atribuído pelos participantes a essa oportunidade de se reunir e aprender uns com os outros, especialmente durante uma pandemia que tem tido efeitos tão profundos na vida de tantos e no trabalho diário de todos nós. A pandemia ainda veio acompanhada de um forte isolamento, de ações de repressão e de crescentes ataques a certas comunidades. Esses fatores enfatizaram mais ainda, para as entidades parceiras, a urgência de preservar as documentações de suas comunidades não só para apoiar as lutas atuais por reconhecimento e respeito a direitos humanos básicos mas, também, para impedir iniciativas futuras que visem eliminar a memória ou negar a existência de violências e injustiças que sabemos vêm sendo cometidas. Esse compromisso compartilhado trouxe um grande senso de solidariedade para os participantes e um desejo de apoio mútuo.
“Para o nosso time, foi uma experiência enriquecedora que nos permitiu refletir, como parte de um grupo multinacional, sobre as conquistas e expectativas do projeto LLILAS Benson Mellon”, relatou Carlos Henríquez Consalvi (conhecido como “Santiago”), do MUPI, que também ressaltou como positiva a oportunidade de conhecer de perto o trabalho dos arquivos parceiros “e entender os desafios que eles enfrentam com a conservação e difusão de suas respectivas coleções”.
Carolina Rendón, um dos dois participantes do Centro de la Memoria Monseñor Juan Gerardi, do ODHAG, disse que os fardos diários da pandemia ficaram mais leves com a oportunidade de interagir com outras pessoas: “Foi muito bom estar no mesmo espaço, junto com gente que trabalha em diferentes arquivos espalhados pela América Latina. A pandemia tem sido muito dura. Durante as oficinas nós passamos por vários estágios, primeiro o lockdown, depois o medo, depois o horror diante de tantas mortes… Eu valorizo muito esse travar conhecimento, mesmo que virtualmente, com gente que trabalha em arquivos de outros países”.
Para a equipe da LLILAS Benson, os comentários positivos e a sensação geral de gratidão pela solidariedade dos encontros online foram uma compensação pelo trabalho árduo que foi preparar os diversos vídeos semanais de treinamento em espanhol, cujos roteiros iam sendo rapidamente traduzidos para o português pela nossa expert colaboradora Tereza Braga. Nas palavras de David A. Bliss, arquivista de processamento digital, “o maior desafio foi destilar uma quantidade gigantesca de dados técnicos para obter apenas os elementos mais importantes e comunicar esses elementos da maneira mais clara possível em espanhol”.
David aludiu ainda ao fato de que as próprias entidades parceiras são um grupo bem diversificado, com formações, necessidades, e tipos de arquivos diferentes. “Algumas das nossas parceiras já rodam programas de digitalização há anos mas, para outras, as informações eram todas novas, então eu me dediquei muito para poder chegar a um equilíbrio entre os dois lados, usando recursos visuais e definições bem claras para os termos técnicos”, ele declarou.
Um dos aspectos mais gratificantes da série foi constatar que é possível reunir profissionais arquivísticos e líderes ativistas, todos trabalhando para preservar registros importantes de memória no campo dos direitos humanos, em um só espaço, mesmo sendo um espaço virtual, para compartilhar seu trabalho e suas perspectivas e se enriquecerem mutuamente. David explicou isso dizendo que “o normal é trabalharmos individualmente com cada organização parceira para auxiliá-la a administrar seu projeto de digitalização, com a meta de capturar todas as coleções daquela entidade e reuní-las no LADI para incentivar usuários a estabelecer conexões entre elas. Mas muitas das nossas parceiras não se restringem à guarda de coleções de documentos históricos; elas estão engajadas em tempo real na luta em prol de suas comunidades. Elas são, portanto, muito melhor equipadas para ajudar uma à outra a traçar estratégias e conseguir êxito nesse trabalho do que nós. Sendo assim, dar a elas o espaço para formar essas conexões diretas umas com as outras é realmente importante. E isso é muito validador para nós também, porque essa tem sido uma das nossas metas há anos já: queremos ser apenas um elo de uma rede de parceiras; não queremos estar no centro da rede”.
* Arquivística pós-custodial é um processo utilizado para preservar digitalmente certos arquivos, muitos deles vulneráveis, e disponibilizar essas versões digitais para o mundo inteiro aumentando, assim, o acesso aos conteúdos e assegurando, ao mesmo tempo, que eles permaneçam sob a guarda e os cuidados de suas comunidades de origem. A LLILAS Benson é uma pioneira desta prática.
The Simple Web Archiver—a straightforward, open source web archiving tool to create personal archives of websites and the files they host—has been published on GitHub under the GNU General Public License, allowing users to use and remix the tool with minimal limitations. The tool, built in Python, provides a GUI interface, and uses BeautifulSoup and wget to parse websites and download files, respectively. I created the tool as part of my work as the European Studies Librarian at the UT Libraries.
Archiving websites is an important practice for anyone interested in preserving digital history. Digital media, and media online, is particularly vulnerable to being lost, as it is often ephemeral in nature and not preserved in an archival format. Saving born-digital materials complements the archiving, curation, and preservation of physical materials, and helps to ensure that internet-based ephemera will be preserved into the future.
Why use this tool?
This tool provides an easy way to create small, personal archives that live offline. While there are many useful web archiving tools available (listed below), this program fills a gap not addressed by existing solutions. Its scope is intentionally small: it aims to create low-memory- use archives for personal use, and to be as easy to use as possible so that users with limited technical knowledge can begin using it immediately, without a complicated setup process or learning curve.
The tool uses a GUI to make the tool very easy to use. Following the directions on the GitHub site allows one to set up the tool and begin using it almost instantly. Another important aspect of this tool is the ease with which it can be modified, by those with some coding experience, to accomplish something else or adapt to the behavior of a certain site. One example of such remixing is this code to capture Omeka sites, specifically, downloading more of the site’s content than the Simple Web Archiver does by default.
Extant web archiving tools tend to accomplish different things than the Simple Web Archiver, and to encompass different scopes. Here is a brief review of other popular software:
Internet Archive – An excellent and easy-to-use tool, but the archives created are hosted online, on the Internet Archive’s servers. Also, not all files will be preserved when crawling a website (PDFs, for example, cannot be archived).
ArchiveIt: also from the Internet Archive, this is great for institutions who want online hosting. It operates on a paid model and, again, not ideal for individual researchers or archivists who want a quick, easy archive of a site and its files.
HTTrack: HTTrack is a free, offline browser utility. Per the tool’s website, it “allows you to download a World Wide Web site from the Internet to a local directory, building recursively all directories, getting HTML, images, and other files from the server to your computer. HTTrack arranges the original site’s relative link-structure.” This is good for those who want a thorough, complete archive of a site, but is not geared toward quick, low-memory-use archives to be stored for personal use.
WarcIt – This tool is entirely programmatic, and only provides WARC (Web ARChive) files. There is no GUI available.
Archivebox – This tool is self-hosted, but programmatic. More robust than the Simple Web Archiver, but its features are not necessarily needed for quick, easy-to-setup or one-off archives. It does not save PDFs, or other files.
Wget – A programmatic tool to download content from the internet. The Simple Web Archiver tool primarily uses wget on the backend to grab online materials.
This tool should work well out of the gate, but there is always the possibility that certain websites, due to their specific architectures, may not be completely archived. The tool’s code was written to be open-ended and adapt to many different types of sites, but for users with specific wants or use cases, it provides a blueprint for the creation of a variation on the tool, or even a completely new piece of software. It is also designed to run relatively quickly, and to grab the main content of a site without unnecessarily consuming CPU power.
The code is simple, and all in one file. The two main functions in the tool download either HTML/CSS/other file types or WARCs, depending on user preference.
The code is also released under the GNU General Public License v3.0. This is a strong copyleft license conditioned on making available complete source code of licensed works and modifications, which include larger works using a licensed work, under the same license. Using this license allows for a wide range of remix and reuse by users and programmers.
I would encourage anyone interested in web archiving to give the tool a try, and to contribute in any way they’d like: by remixing the tool’s code, forking the GitHub repository, or by simply using the tool and providing any feedback they’d like to share.
The Archive of the Indigenous Languages of Latin America (AILLA) is delighted to announce the launch of a free online course called Archiving for the Future: Simple Steps for Archiving Language Documentation Collections, available at https://archivingforthefuture.teachable.com/. The course material is based upon work supported by the National Science Foundation under Grant No. BCS-1653380 (Susan S. Kung and Anthony C. Woodbury, PIs; September 1, 2016, to August 31, 2020). The course is licensed under a Creative Commons Attribution-ShareAlike 4.0 International license.
The course is a resource to aid people of all backgrounds in organizing born-digital and digitized language materials and data for deposit into any digital repository (not just AILLA) for long-term preservation and accessibility. The target audience for this course is anyone who is engaged in creating materials in or about Indigenous, endangered, under-documented, or minority languages as part of language documentation efforts, including language rights, maintenance, and revitalization. It was designed particularly for individuals or groups made up of academic researchers and/or Indigenous or endangered language speakers and community members, though anyone may benefit from it.
The curriculum follows simple steps to guide participants through three phases of work to organize language documentation materials for archiving, and it explains in detail what to do before, during, and after data collection to facilitate the long-term preservation of the data. The course is designed to be informative, engaging, and accessible to anyone, especially to those with no previous experience archiving collections of language materials.
This course was developed by four members of the AILLA staff: Susan Kung, AILLA Manager and grant co-PI; Ryan Sullivant, AILLA Language Data Curator; Alicia Niwabaga, Graduate Research Assistant 2017–2018; and Elena Pojman, Undergraduate Research Assistant 2019–2020. Sullivant and Kung interviewed representatives of various DELAMAN (delaman.org) archives and other digital data repositories in the United States, the United Kingdom, the European Union, Australia, and Cameroon. Niwagaba collaborated with Kung and Sullivant to develop an early version of the course that the AILLA team taught live at the Institute on Collaborative Language Research (CoLang 2018) at the University of Florida in Gainesville during June 18–22, 2018. Niwagaba created the educational animated videos that are embedded in the course to illustrate key aspects of the curriculum. Pojman researched curriculum platforms in which to build the online course. Teachable was selected for a variety of reasons, including its simple yet attractive aesthetic that displays all course modules in the left side bar (see illustration below); its ease of use and progress tracking for enrolled students; its responsiveness to different technology; and the built-in ability to quickly and easily set up the same course in multiple languages. This last feature is especially important since AILLA staff plan to translate the curriculum into Spanish and Portuguese to make it more accessible to AILLA’s Latin American audience. Once the curriculum software was selected, Kung and Sullivant expanded the original 2018 workshop curriculum and wrote the additional content. Pojman wrote the objectives and activities for each step, built the English course in Teachable, and created all of the graphics that are used in the curriculum.
In funding and academic environments where it is becoming increasingly common for researchers to be responsible for archiving their own research data, the AILLA staff saw a need to train language researchers to do this work so that the resulting language collections would be well organized, well described, easy to navigate, and available to reuse for further research and education. While there are some language documentation programs in North America, Europe, Australia, and New Zealand that train language documenters to do these tasks, most do not, and almost no training on how to archive language documentation is available in Latin America. The AILLA team created this course to fill these gaps.
In the spring of 2019,LLILAS Benson Latin American Studies and Collections partnered with the Urban Teachers Program at the Department of Curriculum and Instruction in the College of Education to develop and provide free, online access to high school lesson plans. The goal was to bring together the historical perspectives of underrepresented groups, current scholarship, and digitized holdings of the Benson Latin American Collection and Latin American partners. Thanks to a Department of Education Title VI grant, LLILAS Benson was able to create a portal via UT Libraries’ open-access repositories to make these resources widely available to teachers.
For the past two years, College of Education graduate students have been creating World History and World Geography units for use in high school classrooms. The underlying principle for these teaching materials is that students are able to understand, and then subvert, dominant historical narratives in Latin American, U.S. Latinx, and African Diaspora history given the marginalized perspectives the lesson plans highlight. Using the Benson’s digital collections, they have focused on a variety of topics, including women in colonial Latin America, the Mexican Revolution, and the Cold War in Central and South America (publication in process).
The collaboration and site has since broadened to include other disciplines, audiences, and learning objectives. LLILAS Benson Digital Scholarship staff has been partnering with faculty and graduate students in Latin American Studies, Art and Art History, Spanish and Portuguese, Mexican American Studies, and History to design Digital Humanities–focused lesson plans and assignments for undergraduate teaching. Work is also ongoing to publish technical capacity-building teaching and learning resources for graduate students, digital humanists, and archival professionals at UT Austin and beyond.
The site also helps instructors and students find and browse through LLILAS Benson’s digital resources. It consolidates under its Primary Sources section all existing LLILAS Benson digital scholarship projects, digitized collections, and exhibitions. Visitors can filter these resources by grade level, date range, course subject, and country to find relevant primary and secondary sources on their research and teaching focus.
Explore the site through http://curriculum.llilasbenson.utexas.edu/. The interdisciplinary collaborations and site’s development were generously funded by the U.S. Department of Education’s Title VI Program and LLILAS Benson’s Excellence Fund for Technology and Development in Latin America. This resource was conceived, designed, and launched by:
Lindsey Engleman, Public Engagement Coordinator (2014–2019), LLILAS Benson Latin American Studies and Collections
Tiffany Guridy, Public Engagement Coordinator, LLILAS Benson Latin American Studies and Collections
Delandrea S. Hall, Doctoral Candidate, Curriculum and Instruction, College of Education
Rodrigo Leal, Website Designer and Student Technician(Spring 2019), LLILAS Benson Latin American Studies and Collections
Casz McCarthy, Public Engagement Graduate Research Assistant, LLILAS Benson Latin American Studies and Collections
Albert A. Palacios, Digital Scholarship Coordinator, LLILAS Benson Latin American Studies and Collections
Cinthia S. Salinas, Professor and Chair, Department of Curriculum and Instruction, College of Education
UT Libraries Digital Stewardship (Anna Lamphear and Brittany Centeno)
Read, hot & digitized: Librarians and the digital scholarship they love — In this series, librarians from UTL’s Arts, Humanities and Global Studies Engagement Team briefly present, explore and critique existing examples of digital scholarship. Our hope is that these monthly reviews will inspire critical reflection of and future creative contributions to the growing fields of digital scholarship.
Katie L. Coldiron is the Global Studies Digital Projects GRA at Perry-Castañeda Library and a current graduate student at the School of Information. She also has an M.A. in Latin American Studies.
Arab migration to the Americas is a unique phenomenon, particularly in the context of northern South America. While Arabs came to the Americas fleeing such events as the collapse of the Ottoman Empire and the Armenian Genocide, they mainly went to the countries of the Southern Cone and Brazil, which are known for welcoming more immigrants than their other Latin American counterparts. Colombia and Venezuela, which are referred to as sister countries, tend to be portrayed outside their borders in a negative light, rather than for that which makes up their rich and diverse cultures. This post is an effort to highlight a group of migrants that came to Colombia and Venezuela and remain there today, both assimilating into their respective countries while also keeping their traditions alive. Furthermore, the existence of these digital primary resources provides a necessary means of continuing academic research in the midst of a global pandemic. The ability to have first-hand accounts readily available of a particular diaspora is a privilege of the digital age, and a mechanism of democratization for the cultural record.
Instituto de Cultura Árabe de Colombia
Through my research into the Arab-Colombian diaspora and conversations with prominent scholars on the subject, I was put in touch with Odette Yidi David of Barranquilla. The city of Barranquilla became a hub for Arab migration to Colombia in the early 20th century due to its proximity to the Port Colombia. Yidi David is a fourth generation Palestinian barranquillera and a scholar on the subject of Arabs in Latin America and the Caribbean. Currently she divides her time teaching at Barranquilla’s Universidad del Norte and serving as executive director of the Colombian Institute of Arab Culture.
Yidi David founded the Colombian Institute of Arab Culture with the intention of constructing bridges between Colombia and the Arab world, as well as generating and sharing responsible knowledge about the Arab world in Colombia. The organization has offered Arabic language classes and talks on a variety of subjects pertaining to the Arab world, and even hosted an Arab culture festival in Barranquilla. With the advent of social distancing, I have noticed an uptick in content they are sharing about digital events, as well as resources and other activities to pass the time of quarantine. Many Zoom panels have also been shared through their social media outlets, including ones that feature Yidi David herself. The Institute also regularly hosts talks on Arab dance and even virtual dance classes. These posts not only provide means of diversion in a time in which not many exist, but also create larger awareness of the existence of the Arab diaspora in Colombia. While prominent families, beauty queens, and even the singer Shakira have brought notoriety to the diaspora, the Institute goes beyond a few notable faces to show that Arab culture is alive in the Colombian Caribbean. Their social media includes Instagram and Facebook.
Los libaneses en Venezuela
My research also led me to the blog “Los libaneses en Venezuela,” or “The Lebanese in Venezuela.” This blog is the product of Venezuelan-Lebanese journalist Tony Frangie Mawad, who regularly contributes to outlets like Caracas Chronicles. To create this blog, Frangie Mawad did recorded interviews with multiple Lebanese immigrants to Venezuela who arrived from the 1940s to 1960s, and chronicled their stories individually. Due to his familial ties to the Lebanese diaspora of Caracas, finding the individuals to interview was, in the words of Frangie Mawad, “only a question of picking up the phone and telling them or their kids about the project.” A photographer friend also helped Frangie Mawad with photographing the featured individuals. The accounts are very personal and are contextualized within the atmospheres of Lebanon and Venezuela at the time. The blog can be found here.