This lesson uses word embeddings and clustering algorithms in Python to identify groups of similar documents in a corpus of approximately 9,000 academic abstracts. It will teach you the basics of dimensionality reduction for extracting structure from a large corpus and how to evaluate your results.
In this video, presented as part of the Friday Frontiers series, Bernard Pochet traces the evolution of Open Science at the University of Liège in the early 2000s, focusing on Open Access and the implementation of a Diamond Open Access journal publishing platform (PoPuPS) and an institutional repository (ORBi).
How would you as a person with deafblindness navigate the world – a world filled with navigation and mobility challenges, inaccessible information, and technologies that rely on the senses of sight and hearing? In this talk, Nasrine Olson (PhD, Associate Professor) introduces the idea behind the formation of the Centre for Inclusive Studies at University of Borås and presents a few projects that have explored ways in which technology can be leveraged to level the playing field.
This lesson is the first of a two-part lesson focusing on an indispensable set of data analysis methods, logistic and linear regression. It provides an overview of linear regression and walks through running both algorithms in Python (using Scikit-learn). The lesson also discusses interpreting the results of a regression model and some common pitfalls to avoid.
In this lesson from Programming Historian, you will learn how to display a georeferenced map from Map Warper in KnightLab's StoryMap JS, an interactive web-based map and storytelling platform.
Researchers often need to be able to search a corpus of texts for a defined list of terms and historians are often interested in certain places named in a text or texts. This lesson details how to programmatically search documents for a list of terms, including place names and then how to obtain coordinates and map historical place names with the World Historical Gazetteer.
Is ChatGPT unsettling you? Are you annoyed to always land on the same webportal when googling for a specific book? Do you hate it when just the one page you need to consult is nowhere to be found on the internet? This presentation by Anne Baillot is for you!
This training event from the TRIPLE Project was devoted specifically to FAIR Data in SSH and provided answers to the following questions, among others: How is research data defined in SSH; Why are FAIR principles important for the management of research data in SSH; How can FAIR principles be implemented in SSH.
The SSHOC-DARIAH Train-the-Trainer Research Data Management Bootcamp ('Research Data Management Bootcamp' for short) took place over two half-day workshops that gave access to experts in the field and allowed for real-time activities between the sessions. It was co-organised by the SSHOC project and the DARIAH 'Research Data Management' Working Group.
In this lecture, Victoria Van Hyning explores the possibilities of crowdsourcing as "cultural heritage co-creation" or "commons-based peer production", expanding on the need for further comparative analysis of design and engagement strategies for crowdsourcing projects, their resulting data and possible applications for these data in Machine Learning training sets.
This tutorial explains the fundamentals and usage of the DARIAH-DE Collection Registry, a tool that allows you to describe and index data collections. The manual gives an overview of the usability and functionalities of the Collection Registry and introduces best practice recommendations.
This video recording is of 'Using Digital Archives for Geographical and Archaeological Research', the second webinar in a three-part public lecture series hosted by the Digital Repository of Ireland (DRI), aimed at early career researchers. The webinar showcases the rich research resources contained in digital archival collections that can be used to advance geographical and archaeological research.