The potential of some institutional repositories is hampered by the lack of subject indexing. I argue that this situation can be amended with the help of machine indexing. As proof of concept I show that Edoc — the University of Basel’s repository — can be successfully indexed using the Annif-client Python library. In order to do so, I assess the performance of hundreds of Annif configurations in assigning subject terms to a sample data set against a gold standard constructed from cleaned and reconciled author keywords.
-
Notifications
You must be signed in to change notification settings - Fork 0
UZH MAS thesis "Machine indexing of institutional repositories: indexing Edoc with Annif as proof of concept" by Maximilian Hindermann, 2021
License
MHindermann/mas
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
About
UZH MAS thesis "Machine indexing of institutional repositories: indexing Edoc with Annif as proof of concept" by Maximilian Hindermann, 2021