Skip to content

UZH MAS thesis "Machine indexing of institutional repositories: indexing Edoc with Annif as proof of concept" by Maximilian Hindermann, 2021

License

Notifications You must be signed in to change notification settings

MHindermann/mas

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Machine indexing of institutional repositories: indexing Edoc with Annif as proof of concept

DOI

The potential of some institutional repositories is hampered by the lack of subject indexing. I argue that this situation can be amended with the help of machine indexing. As proof of concept I show that Edoc — the University of Basel’s repository — can be successfully indexed using the Annif-client Python library. In order to do so, I assess the performance of hundreds of Annif configurations in assigning subject terms to a sample data set against a gold standard constructed from cleaned and reconciled author keywords.

  • Text and images available at /text/
  • Python codebase and data available at /files/
  • Documentation available at /docs/

About

UZH MAS thesis "Machine indexing of institutional repositories: indexing Edoc with Annif as proof of concept" by Maximilian Hindermann, 2021

Resources

License

Stars

Watchers

Forks