Tag Archives: Read Hot and Digitized

Read, Hot & Digitized: Visualizing Wikipedia’s Gender Gap

Read, hot & digitized: Librarians and the digital scholarship they love — In this new series, librarians from UTL’s Arts, Humanities and Global Studies Engagement Team briefly present, explore and critique existing examples of digital scholarship.  Our hope is that these monthly reviews will inspire critical reflection of and future creative contributions to the growing fields of digital scholarship.

Wikipedia is a website that many of us use every day – yes, even us librarians! Wikipedia was founded with utopian ideals, with its democratic approach to content creation and always-free, open knowledge. Therefore, it seems like the ideal platform to address structural inequalities in our information systems that reflect and reinforce racism, misogyny, homophobia, and transphobia and combinations thereof.

However, Wikipedia has a long-standing problem of gender imbalance both in terms of article content and editor demographics. Only 18% of content across Wikimedia platforms are about women. The gaps on content covering non-binary and transgender individuals are even starker: less than 1% of editors identify as trans, and less than 1% of biographies cover trans or nonbinary individuals. When gender is combined with other factors, such as race, nationality, or ethnicity, the numbers get even lower. This gender inequity has long been covered in the scholarly literature via editor surveys and analysis of article content (Hill and Shaw, 2013; Graells-Garrido, Lalmas, and Menczer, 2015; Bear and Collier, 2016; Wagner, Graells-Garrido, Garcia, and Menczer, 2016; Ford and Wajcman, 2017). To visualize these inequalities in nearly real time, the Humaniki tool was developed.

Humaniki was created in 2020 by merging two previous data visualization projects. Data scientist Maximillian Klein created the Wiki Data Human Gender Indicators project in 2016. The French project Denelezh was created by Enzel Le Mir for Wikimedia France in 2017. Both projects utilized the Wikidata API and merged because of their significant overlap and shared mission, and Klein recently received a grant from the Wikimedia Foundation to continue this work. Humaniki is also built using Python, and its backend code is available on GitHub

Humaniki has many ways to explore this data. One of the most interesting is to look at the numbers based on language. Wikipedia isn’t just available in English, and Humaniki offers users the chance to look at gender representation for biographies in 529 languages! Another interesting data point is Year of Birth, and the trends in the Humaniki data suggest the gender gap closes slightly for biographies about younger people. For example, 23% of biographies on people born in 1963 are about women. For biographies on people born in 1983, however, 29% are about women. 

Humaniki also provides numbers of biographies on people who identify as “other genders” (people whose gender identity is not cisgender). For each metric, you can review the “Other Genders Breakdown,” which lists out all the gender identities (trans women, trans men, nonbinary, genderfluid, two-spirit, etc.) included in that particular data point. The “Other Genders” metric is important because the numbers are so stark. Looking back to our examples from 1963 and 1983, only 16 biographies in the 1963 dataset and 31 from 1983 are about people who don’t identify as cisgender – that’s out of more than 50,000 biographies! This highlights the great need to create and expand articles on people who identify outside of the traditional gender binary.

Humaniki is a useful tool for building awareness of the Wikipedia gender gap, and there are many ways to act upon this knowledge and get involved. The UT Libraries sponsors multiple Wikipedia edit-a-thons focused on improving articles about women and LGBTQ+ people. Every March, we host Queering the Record, a homegrown edit-a-thon to improve queer and trans representation, and we participate in the international campaign Art + Feminism, which focuses on gender, feminism, and the arts. Additionally, we’ve hosted one-off edit-a-thons covering Latinx and Mexican women, Indigenous languages, and women and LGBTQ+ people in STEM fields. Keep an eye on the UT Libraries events page to learn about future edit-a-thons!

Scholarship and Popular Press on the Wikipedia Gender Gap

Bear, Julia B., and Benjamin Collier. “Where are the women in Wikipedia? Understanding the different psychological experiences of men and women in Wikipedia.” Sex Roles 74, no. 5-6 (2016): 254-265. 

Filipacchi, Amanda. “Wikipedia’s Sexism Toward Female Novelists.” The New York Times, April 24, 2013. 

Ford, Heather, and Judy Wajcman. “‘Anyone can edit’, not everyone does: Wikipedia’s infrastructure and the gender gap.” Social Studies of Science 47, no. 4 (2017): 511-527.

Gordon, Maggie. “Wikipedia Editing Marathons Add Women’s Voices to Online Resource.” Houston Chronicle, November 9, 2017. https://www.houstonchronicle.com/life/article/Adding-women-s-voices-to-Wikipedia-12344424.php

Graells-Garrido, Eduardo, Mounia Lalmas, and Filippo Menczer. “First women, second sex: Gender bias in Wikipedia.” In Proceedings of the 26th ACM Conference on Hypertext & Social Media, pp. 165-174. 2015.

Hill, Benjamin Mako, and Aaron Shaw. “The Wikipedia Gender Gap Revisited: Characterizing Survey Response Bias with Propensity Score Estimation.” PloS One 8, no. 6 (2013): e65782–e65782.

Paling, Emma. “The Sexism of Wikipedia.” The Atlantic, October 21, 2015. https://www.theatlantic.com/technology/archive/2015/10/how-wikipedia-is-hostile-to-women/411619/

Stephenson-Goodknight, Rosie. “Viewpoint: How I Tackle Wiki Gender Gap One Article at a Time.” BBC News, December 7, 2016. https://www.bbc.com/news/world-38238312

“The Nobel Prize Winning Scientist Who Wasn’t Famous Enough for Wikipedia.” The Irish Times, October 3, 2018. https://www.irishtimes.com/life-and-style/people/the-nobel-prize-winning-scientist-who-wasn-t-famous-enough-for-wikipedia-1.3650212

Wagner, Claudia, Eduardo Graells-Garrido, David Garcia, and Filippo Menczer. “Women through the glass ceiling: gender asymmetries in Wikipedia.” EPJ Data Science 5 (2016): 1-24.

plenty of fish in the sea: using dutch art to study historic biodiversity

Read, hot & digitized: Librarians and the digital scholarship they love — In this new series, librarians from UTL’s Arts, Humanities and Global Studies Engagement Team briefly present, explore and critique existing examples of digital scholarship.  Our hope is that these monthly reviews will inspire critical reflection of and future creative contributions to the growing fields of digital scholarship.

Fishing in the past” encourages us to explore the connections between artistic expression, scientific identification, and commercial practices. A crowdsourced metadata project, “Fishing in the past” asks volunteers to identify fish species represented in Dutch still life paintings from the early modern period to learn more about historical aquatic biodiversity and commercial uses of fish in Europe. The campaign is part of “A new history of fishes,” a project funded by the Dutch Research Council that includes researchers from Leiden University Centre for the Arts in Society and Naturalis Biodiversity Centre. The artwork included in the “Fishing in the past” campaign comes from the Rijksmuseum and the RKD – Netherlands Institute for Art History. The project was designed using Zooniverse, “the world’s largest and most popular platform for people-powered research.”[1] This crowdsourced approach to research has been termed “citizen science.”[2]

I discovered “Fishing in the past” while evaluating Zooniverse for possible use in the creation of a crowdsourced metadata campaign for photographs from the “Sajjad Zaheer Digital Archive.” I was intrigued by the project’s use of art to support scientific research. This is just one example of how digital scholarship tools and methods can facilitate interdisciplinary projects that propose creative solutions to existing research problems. “A new history of fishes” examines the relationship between ichthyology (the study of fish) and European history and culture, an area of inquiry that “has always been underexposed.”[3] Though quite different in subject matter, the “Sajjad Zaheer Photo Archive” and “Fishing in the past” share the objective of identifying beings (human and aquatic, respectively) in images, a belief in the value of opening up research projects to the general public, and a commitment to open access data and information. As such, “Fishing in the past” was a helpful model for my own project.

“Fishing in the past” asks members of the public to identify the species for every fish in an image. The research team provides tools to help, such as a list of common species that includes images and identifying features to assist classification. The species list can filtered by characteristic, such as color or pattern. After identifying the species, contributors are instructed to classify the commercial use of the fish, such as traded at a market or consumed on plate. They finally record the number of fish for a single species in the image. The process is repeated for each species pictured.

The “Fishing in the past” team has already shared some initial results and plans to publish further findings in an open access journal. Through crowdsourcing, this project has generated more data in a shorter period of time than could be achieved by the research team alone. Benefits for volunteers include engaging in their interests, interacting with artistic and scientific materials in new ways, and knowing that they are making a contribution to something bigger than themselves. For future researchers, crowdsourcing campaigns provide valuable data, including the ability to “read” materials with accessibility technologies.

All Zooniverse campaigns can be found here. Those interested in crowdsourced transcription work might also enjoy participating in FromThePage projects from University of Texas Libraries.

The Fine Arts Library holds catalogs that accompanied past Dutch and Flemish still life exhibitions.

Those interested in marine science should start with this LibGuide.

[1] https://www.zooniverse.org/about

[2] For an in-depth look at citizen science: Hecker, S., Haklay, M., Bowser, A., Makuch, Z., Vogel, J., & Bonn, A. (2018). Citizen Science: Innovation in Open Science, Society and Policy. University College London.

[3] https://www.universiteitleiden.nl/en/research/research-projects/humanities/new-history-of-fishes

Madeline Goebel is the Global Studies Digital Projects GRA at Perry-Castañeda Library and a current graduate student at the School of Information.

“IT IS DULL, SON OF ADAM, TO DRINK WITHOUT EATING:” ENGAGING A TURKISH DIGITAL TOOL FOR THE STUDY OF ISLAMIC THOUGHT


Read, hot & digitized: Librarians and the digital scholarship they love — In this new series, librarians from UTL’s Arts, Humanities and Global Studies Engagement Team briefly present, explore and critique existing examples of digital scholarship.  Our hope is that these monthly reviews will inspire critical reflection of and future creative contributions to the growing fields of digital scholarship.

Over the years of my involvement in Middle Eastern and Islamic Studies (MEIS), I have become something of an advocate for learning modern Turkish. The necessity of facility with Turkish in order to conduct research in MEIS, and more importantly, to carry on scholarly communication in MEIS, grows clearer every year. I would not hesitate to argue that non-Turkish scholars ignore Turkish scholarship at their own peril—it is that central, plentiful, and informative. An excellent example of a scholarly development out of Turkish academe that would be quite useful for MEIS pedagogy and research is İslam Düşünce Atlası, or The Atlas of Islamic Thought. It also happens to be an incredible digital Islamic Studies scholarship initiative.

İslam Düşünce Atlası (İDA) is a project of the İlim Etüdler Derneği (İLEM)/Scientific Studies Association with the support of the Konya Metropolitan Municipality Culture Office. It is coordinated by İbrahim Halil Üçer, with the support of over a hundred researchers, design experts, software developers, and GIS/map experts. The goal of the project is to make the academic study of the history of Islamic thought easily accessible to scholars and laypeople alike through new (digital) techniques and within the logic of network relations. İDA has been conceived as an open-access website with interactive programs for a range of applications. Its developers intend it to contribute a digital perspective to historical writing on Islam: a reading of the history of Islamic thought from a digitally-visualized time-spatial perspective and context.

İDA features three conceptual maps that aim to visualize complex relationships and to establish a historical backbone for the larger project of the atlas: the Timeline (literally time “map,” which is a more signifying term for the tool, Zaman Haritası), the Books Map (Kitaplar Haritası), and the Person Map (Kişiler Haritası). It also proposes a new understanding of the periodization of Islamic history based on the development of schools of thought (broadly defined) and their geographic spread. İDA endeavors to answer several questions through these tools: by whom, when, where, how, in relation to which school traditions, through what kinds of interactions, and through which textual traditions was Islamic thought produced? Many of these questions can be summed up under the umbrella of prosopography, and in that arena, İDA has a few notable peer projects: the Mamluk Prosopography Project, Prosopographical Database for Indic Texts (PANDiT), and the Jerusalem Prosopography Project (with a focus on the period of Mongol rule), among others.

One of my favorite aspects of İDA is the book map and its accompanying introduction. The researchers behind İDA do their audience the great service of explaining the development and establishment of the various genres of writing in the Islamic sciences. Importantly, they also link the development of these genres to the periodization of Islamic history that they propose. The eight stages of genre development that are identified—collation/organization, translation, structured prose, commentary, gloss, annotation, evaluative or dialogic commentary, and excerpts/summaries—share with the larger İDA project their origin in scholarly networking and relationship building. By visualizing the networks of Muslim scholars, as well as the relationships among their scholarly production and the non-linear, multi-faceted time “map” of Islamic thought, İDA weaves together the disparate facets of a complex and oft willfully misunderstood intellectual tradition

I encourage readers not only to learn some modern Turkish in order to make full use of İDA (although Google translate will work in a pinch!), but also to explore threads throughout all of the visualizations: for example, trace al-Ghazālī’s scholarly network, and then look at that of his works. What similarities and differences do you notice? Is there a pattern to the links among works and scholars? Readers who are interested in the intellectual history of Islam should check out my Islamic Studies LibGuide, as well as searches in the UT Libraries’ catalog for some of their favorite authors (see here for al-Ghazālī/Ghazzālī, Ibn Sina/Avicenna, and Ibn al-Arabi).

Read, Hot and Digitized: Braceros Tell Their Stories

Read, hot & digitized: Librarians and the digital scholarship they love — In this new series, librarians from UTL’s Arts, Humanities and Global Studies Engagement Team briefly present, explore and critique existing examples of digital scholarship.  Our hope is that these monthly reviews will inspire critical reflection of and future creative contributions to the growing fields of digital scholarship.

A twenty-two-year program that began during World War II and is still relevant nearly sixty years after its conclusion in 1964, the Bracero Program was an agreement between the U.S. and Mexican governments to permit short-term Mexican laborers to work in the United States.

In an effort to stem labor shortages during and after the war years, an estimated 4.6 million workers came to the USA with the promise of thirty cents per hour and “humane treatment.” Of course, we know that loosely defined terms like “humane treatment” present a slippery slope that can erase and omit stories. Fortunately, through the collaborative efforts of the Roy Rosenzweig Center for History and New Media, George Mason University, the Smithsonian National Museum of American History, Brown University, and the University of Texas at El Paso’s Institute of Oral History, many of those once-hidden stories have been preserved and made accessible through the Bracero History Archive (BHA).  

The BHA offers a variety of materials, most notably over 700 oral histories recorded in English and Spanish. While the metadata fields for each oral history could be more robust, the ability to hear first-hand accounts and inter-generational stories is a dream come true for primary source-seekers. All audio is available to download in mp3 format for future use.  

Apart from oral histories, other resources are also available. Images, such as photographs and postcards, provide visuals of the varied environments that hosted the Braceros as well as portraits of the Braceros themselves.  

Again, further detail on these resources would benefit the archive. For example, the photograph above, titled “Two Men,” demonstrates a lack of context needed for a more profound understanding while also acknowledging the potentially constant transient nature of Bracero work. In fact, the very word bracero, derived from the Spanish word for “arm,” is indicative of the commodification and dehumanization of the human body for labor. Workers lived in subpar work camps, received threats of deportation, and lacked proper nourishment, especially given the arduous work conditions.  

Additional BHA resources include a “documents” section in which offspring share anecdotes about the Bracero Program and track down information about loved ones. Finally, the site offers resources for middle school and high school teachers to use in their curriculum. Here again is an opportunity to further build out the site for university-level instruction.  

The digital objects in the BHA are worthwhile for those looking to recover an often-overlooked subject in American history that still resonates with themes relating to immigration today. Indeed, farmworkers continue to be exploited and underappreciated despite their contributions to society. This has led to a number of movements, marches, and boycotts in efforts to improve living conditions and wages. 

For those interested in oral history collections at the University of Texas Libraries, look no further than the Voces Oral History Project and Los del Valle Oral History Project, both housed at the Nettie Lee Benson Latin American Collection. Similarly, collections related to farmworkers, and undoubtedly influenced by the legacies of the Bracero Program, include the Texas Farm Workers Union Collection and the María G. Flores Papers.