[CODE4LIB] EDUC: Introduction to Text Encoding starts next week, 9/11 last day to enroll

2021-09-03 Thread Lacey Torge
***Apologies for cross-posting*** *Introduction to Text Encoding* Dates: September 6 - October 3 Instructor: John Russell Credits: 1.5 CEUs or 15 PDHs Price: $200 https://libraryjuiceacademy.com/shop/course/133-introduction-to-text-encoding This course will introduce students to text encoding a

[CODE4LIB] Call for Proposals: Facing the Future of Resource Sharing - an Amigos online conference

2021-09-03 Thread Emily Nimsakont
Facing the Future of Resource Sharing - an Amigos Library Services online conference: https://www.amigos.org/facing Take a closer look at the future of resource sharing with Amigos Library Services on December 8, 2021, as we explore resource sharing trends on the horizon and ways of providing p

Re: [CODE4LIB] Comparing OCR output to dictionary

2021-09-03 Thread Esmé Cowles
Kim- Yes, I have a script that Jon Stroop wrote to do something very similar: https://github.com/pulibrary/pulfa-sausage-factory/blob/master/bin/orient_image.sh The use case was a little different (orient the

[CODE4LIB] Job: Associate University Librarian at University of Georgia Libraries

2021-09-03 Thread Code4Lib Jobs
 The Associate University Librarian (AUL) is a member of the senior leadership team at the University of Georgia Libraries. The position is responsible to the  University Librarian and Associate Provost for leading and advising on matters relating to the development and implementation of the m

[CODE4LIB] Job: IT PROGRAMMER ANALYST – HEADQUARTERS at Sonoma County Library

2021-09-03 Thread Code4Lib Jobs
Sonoma County Library Announces an Employment Opportunity IT PROGRAMMER ANALYST – ROHNERT PARK HEADQUARTERS 40 HOURS PER WEEK – FULL TIME   THE POSITION: Under general supervision, the IT Programmer Analyst manages and administers Information Technology (IT) systems for the Sonoma County

[CODE4LIB] Job: Instructional Technologist at Wheaton College

2021-09-03 Thread Code4Lib Jobs
Under the direction of the Director of Research and Instruction within Library Services, the Instructional Technologist provides expert guidance in the application of instructional technologies and acts as a point of contact, collaboration, and outreach for teaching and learning with technolo

[CODE4LIB] Job: Head of Research and Access Applications at University of Kentucky

2021-09-03 Thread Code4Lib Jobs
The University of Kentucky Libraries is accepting applications for the Head of Research and Access Applications Librarian position: https://ukjobs.uky.edu/postings/349679. The position is responsible for the overall planning, coordination, and management of key applications that support user

[CODE4LIB] Job: Web Applications Librarian at The Catholic University of America

2021-09-03 Thread Code4Lib Jobs
The libraries of The Catholic University of America have as their purpose to support and enrich the instructional, research, and service programs of the university. Electronic Resources and Services is responsible for providing the technology and resources necessary for quality instruction an

[CODE4LIB] Job: Digital Library Developer at University College Cork

2021-09-03 Thread Code4Lib Jobs
UCC wishes to appoint to the role of Digital Library Developer.  Reporting to the Head of Research & Digital Services, UCC Library and working closely with colleagues in Library and IT Services, the Digital Library Developer will analyse evolving business needs, identifying and implementing a

Re: [CODE4LIB] JSON-LD API

2021-09-03 Thread Bigwood, David
Thanks for the pointers. Sometimes it helps to look at examples of what the results should look like. Dave From: Code for Libraries on behalf of Vo?, Jakob Sent: Thursday, September 2, 2021 2:58 AM To: CODE4LIB@LISTS.CLIR.ORG Subject: [CODE4LIB] AW: [CODE4LIB]

Re: [CODE4LIB] Comparing OCR output to dictionary

2021-09-03 Thread Sarah Swanz
Sorry, I’m told the links got lost in transit: Notebook:  https://data.nls.uk/tools/jupyter-notebooks/exploring-britain-and-uk-handbooks/ Fastwer article:  https://towardsdatascience.com/evaluating-ocr-output-quality-with-character-error-rate-cer-and-word-error-rate-wer-853175297510  (and links t

Re: [CODE4LIB] Comparing OCR output to dictionary

2021-09-03 Thread Sarah Swanz
This Jupyter notebook from the National Library of Scotland has a section on how to evaluate OCR accuracy under the Data Cleaning chapter. You might also check out the 'fastwer' package described in this article. I have not used myself so cannot attest to it. Sarah Swanz University of Michigan,

[CODE4LIB] Call for Papers: CCQ Special Issue on Empowering Representations

2021-09-03 Thread Leigh, Katharine
Call for Papers: Cataloging & Classification Quarterly (CCQ) Special Issue Empowering Representations: Rethinking Surrogates from the Margins The process of describing and categorizing a tangible or intangible object for inclusion in the collection of a cultural heritage institution (CHI) typica

Re: [CODE4LIB] Comparing OCR output to dictionary

2021-09-03 Thread Eric Lease Morgan
On Sep 2, 2021, at 4:07 PM, Kimberly Kennedy wrote: > I was wondering if anyone has created a script or tool to compare the words > in a text file to a dictionary? I'm looking for a way to quantify the > quality of OCR output. I've heard that counting the number of words that > are in the diction