This is a test message.
This is only a test of the Emergency Test Messaging Service.
This is only a message.
In case of a real emergency you would be instructed to
Contact your nearest Code4Lib subscriber.
This is only test message.
—
Eric “Test Message” Morgan
On Jan 17, 2014, at 2:54 PM, Emily Lynema wrote:
> And with that, registration is now closed!… Waitlist:
> https://docs.google.com/a/ncsu.edu/forms/d/1Vuo7g7xbNeGCQywwngkAt4OMdX0k3wsoiwCrxJtdx6k/viewform
Geesh! That was fast. I’m impressed. —Eric Morgan
If you were to select a set of RDF ontologies intended to be used in the linked
data of archival descriptions, then what ontologies would you select?
For simplicity's sake, RDF ontologies are akin to the fields in MARC records or
the entities in EAD/XML files. Articulated more accurately, they a
On Jan 20, 2014, at 11:12 AM, Robert Sanderson wrote:
> Have you considered the LOCAH work in mapping EAD into Linked Data?
>
> * http://archiveshub.ac.uk/locah/
> * http://data.archiveshub.ac.uk/
Robert, yes, and these are and were very good examples. ‘Ahead of their time,
they were. —Eric
On Jan 19, 2014, at 10:58 AM, Karen Coyle wrote:
>> If you were to select a set of RDF ontologies intended to be used in the
>> linked data of archival descriptions, then what ontologies would you select?
>
> There is an ontology for archival description developed by the Europeana
> project. T
A couple of days ago I wrote:
> If you were to select a set of RDF ontologies intended to be used in the
> linked data of archival descriptions, then what ontologies would you
> select?
And in response Ben Companjen wrote the following
post, which I think is absolutely wonderful. So wonderful i
> Finally, 1600 properties... good luck with that.
ROTFL!!! ―ELM
/projects/loc-recollect/
[2] ViewShare - http://viewshare.org/about/community/
—
Eric Lease Morgan
—
Eric Lease Morgan
Correct. It is all but a waste of time. --ELM
On Feb 18, 2014, at 3:01 PM, Owens, Trevor wrote:
> Sorry for not getting back on this sooner. I help project manage Viewshare at
> LC. At this point, there are a few folks I know of who have set up their own
> instances of the software to tinker with but for the most part people are
> just ha
> So, we're shutting it [Code4Lib] down?
We interrupt this program for an important statement:
Before things get out of hand and rumors start flying, there are
no plans about shutting down the mailing list. An individual
simply wanted to be unsubscribed, and that has been done.
Now back
udimentary queries I can send to
SPARQL endpoints, get results, and begin to drill down without really knowing
the ontologies? I’m stymied in this regard.
[1] announcement -
http://blogs.getty.edu/iris/art-architecture-thesaurus-now-available-as-linked-open-data/
[2] data home - http://voca
/archive/2004/200312/0012.html
—
Eric Lease Morgan
Does anybody here work in Rome (Italy). I see a “busman’s holiday” in my
future. —Eric Lease Morgan, University of Notre Dame
, something like D2RQ
could be put on top of the ArchiveSpace database to expose the underlying
content as RDF. What do you think?
[1] D2RQ - http://d2rq.org
—
Eric Lease Morgan
University of Notre Dame
On Mar 6, 2014, at 9:47 AM, Mark A. Matienzo wrote:
> ArchivesSpace has a REST backend API, and requests yield a response in
> JSON. As one option, I'd investigate to publish linked data as JSON-LD.
> Some degree of mapping would be necessary, but I imagine it would be
> significantly easier to t
Let me ask a more direct question. If participating in linked data is a “good
thing”, then how do you — anybody here — suggest archivists (or librarians or
museum curators) do that starting today? —Eric Morgan
On Mar 6, 2014, at 1:37 PM, Ethan Gruber wrote:
>> Let me ask a more direct question. If participating in linked data is a
>> “good thing”, then how do you — anybody here — suggest archivists (or
>> librarians or museum curators) do that starting today? —Eric Morgan
>
> I think that RDFa provide
I wonder whether there are enough people and enough interest to organize a
Code4Lib Italy event. Hmmm... —Eric Lease Morgan
I’ve done a tiny bit of investigation, and as alluded to previously, there
seems to be a higher concentration of Code4Lib-like activity around Bologna and
Padua. And this sort of “event” is intended to be smaller and more informal
rather than larger and more structured. It requires a time, a pla
A few years ago I did a bit of analysis against the feed —
http://infomotions.com/blog/2011/03/constant-chatter-at-code4lib/ —ELM
know of others? Am I
missing something significant?
—
Eric Lease Morgan
Can somebody here point me to an RDF file describing the VIAF ontology? Does an
authoritative list of VIAF classes and properties exist, and if so, where can I
get a copy? —Eric Lease Morgan
On Apr 7, 2014, at 10:52 PM, Richard Wallis
wrote:
> Is this what you are looking for: http://viaf.org/viaf/data/
Alas, no, not really. I’m looking for an RDF file listing the classes and
properties used by VIAF. VIAF can return RDF for specific entities, as in curl:
curl -L -H "accept: ap
On Apr 7, 2014, at 11:25 PM, Richard Wallis
wrote:
> You are correct that in the xmlns namespace definitions in the xml <
> http://viaf.org/viaf/231063554/rdf.xml> references a namespace that is no
> longer used in the output. There is a single commented out reference to '
> viaf:NameAuthorityC
> On Apr 17, 2014, at 9:13 PM, "Miles Fidelman"
> wrote:
>
> Give up and let chaos reign supreme?
Yep! That's what I would do. -- ELM
Do usability studies to demonstrate to others how things can be improved. --ELM
> On May 8, 2014, at 12:36 PM, Jodi Schneider wrote:
>
> Eric -- are you still the list owner? j...@code4lib.org already uses Job: as
> a prefix -- so I would suggest adding "Job" as a topic, setting
> Default-Topics= Job,OTHER (unless all-caps is requisite?) If this works,
> nobody should hav
All of this points to fragility of computer systems. This relies on that. That
relies on the other thing. Repeat. I can see how it is frustrating, and I’ve
experienced it too many times. After I while — IMHO — the whole things becomes
more effort than it it worth, and it is scary because more ti
Here is the tiniest bit of mailing list administratativa: the list now contains
close two 2,800 subscribers, and based on my antidotal observations, it
subscription size increase by approximately five new subscriptions per week. I
think we have a strong, vibrant, and growing community. —ELM
t the job
postings no matter what.
Special thanks go to Jodi Schneider and Joe Hourcle who pointed
me in the direction of this LISTSERV functionality. Thank you!
The LISTSERV topics feature is new to me, and I hope it works as advertised. I
think it will.
[1] blog posting - http://bit.ly
On May 16, 2014, at 3:46 PM, Andreas Orphanides wrote:
>> Library Electronic Resources Specialist
>> Raritan Valley Community College
>> Branchburg Township, New Jersey
>> ColdFusion, EZproxy, JavaScript, Personal computer hardware
>> http://jobs.code4lib.org/job/13115
I’d rather see the whole c
$full_ads++
--
http://orcid.org/-0002-9952-7800
c, most of the time.
* Apparently “administratativia” is a word of my own design because a search of
it in Google returns only postings I’ve written. No wonder my spell checker
doesn’t like it.
[1] description - http://dh.crc.nd.edu/sandbox/mailing-lists/code4lib/
—
Eric Lease Morgan
ORDID and ResearcherID and Scopus, oh my!
It is just me, or are there an increasing number of unique identifiers popping
up in Library Land? A person can now be identified with any one of a number of
URIs such as:
* ORCID - http://orcid.org/-0002-9952-7800
* ResearcherID - http://www.re
> * ORCID - http://orcid.org/-0002-9952-7800
> * ResearcherID - http://www.researcherid.com/rid/F-2062-2014
> * Scopus - http://www.scopus.com/authid/detail.url?authorId=25944695600
> * VIAF - http://viaf.org/viaf/26290254
> * LC - http://id.loc.gov/authorities/names/n94036700
> * ISNI -
On Jun 4, 2014, at 3:32 PM, Jodi Schneider wrote:
> However, I believe that ISNI is bridging between these various sources --
> certainly including LC and VIAF [1], and also ORCID [2].
>
> [1] http://www.isni.org/content/data-contributors
> [2] From http://orcid.org/content/what-relationship-bet
> C4L is not a democracy but an anarchy.
Sometimes. We vote on conference locations. We vote on keynote talks. We vote
for presentations. Everybody had multiple opportunities to voice their opinion.
I think this vote should count too. —ELM
Is ORCID an acronym, and if it is then what does it stand for? —ELM
On Jun 10, 2014, at 4:07 PM, Masamitsu, Pam wrote:
> http://orcid.org/faq-page#n110 - ORCID is an acronym, short for Open
> Researcher and Contributor ID.
Resolved. Thanks! —ELM
On Jun 16, 2014, at 6:44 AM, Johan Oomen wrote:
>> 2.805.000 Euros Funding for Future Memory Standards!
>>
>> The PREFORMA call for tender has been published on June 12th, 2014. Proposal
>> submission deadline: August 12th, 2014. Budget: 2.805.000 EUR. —
>> http://www.preforma-project.eu/tende
|X| |
+-+--+
ORCID | | X|
+-+--+
ISNI | | |
+-+------+
—
Eric Lease Morgan
On Jun 20, 2014, at 10:56 AM, Richard Wallis
wrote:
> authority control|simple identifier |Linked Data capability
> +-+--+--+
> VIAF |X|X | X |
> +-+
Do you know of a Web-based tool or piece of desktop software that would let a
professor post a text in a frame, then highlight words or phrases and link them
to a glossary? A quick-and-dirty web page (possibly attached) and link below
illustrates the idea:
http://dh.crc.nd.edu/tmp/glossary.ht
SemTechBiz for making this
possible! And feel free to give me a shout if you have any questions.
Agenda
Publishing, Sharing, and Opening
* 8:30-9:30 Sylvia Southwick and Cory Lampert, UNLV, Librarians'
adventure into LODLAM
* 9:30-10:20 Eric Lease Morgan, Notre D
On Jul 1, 2014, at 9:12 AM, Katie wrote:
> Has anyone here experience in the world of natural language programming
> (while applying information retrieval techniques)?
>
> I'm currently trying to develop a tool that will:
>
> 1. take a pdf and extract the text (paying no attention to images
I have been a card-carrying librarian since 1987, but I have been writing
computer programs since 1976. I became a librarian because I heard about the
“information explosion”, and I thought it would be a growth industry. —ELM
This is mailing list administratativia. I don’t know why, but the subscription
base seem to now increase by two or three a day. I guess this is a good day.
Maybe I ought to visualize this. Hmmm... —ELM
>From my server’s MOTD, and I thought it was fun:
Here I sit, broken-hearted,
All logged in, but work unstarted.
First net.this and net.that,
And a hot buttered bun for net.fat.
The boss comes by, and I play the game,
Then I turn back to net.flame.
Is there a cure (I need your views),
For
Does anybody here have any experience with the Elsevier API Program? [1]
Apparently, through Elsevier’s TDM (text and data mining) API a person can get
the full text of Elsevier content, after being granted an access key. As per
their instructions, I used the following curl command to try to dow
On Jul 14, 2014, at 5:29 PM, Ben Companjen wrote:
> This will not answer your question, but LIBER has a response to the TDM API
> policies:
>
> http://libereurope.eu/news/over-40-signatories-ask-elsevier-to-withdraw-tdm-policy/
>
> Peter Murray-Rust posted quite a few times about it on his blo
I’m surprised and impressed with the way Code4Lib Midwest is shaping up —
http://wiki.code4lib.org/index.php/Midwest Cool! —ELM
> http://wiki.code4lib.org/index.php/Midwest
I attended Code4Lib Midwest last week, and from my perspective it was a roaring
success. Kudos to Kyle Felker, Erin Fisher, Eric Kunnen, Kristin Meyer, Matthew
Reidsma, and anybody else I may have missed for putting on such a
well-organized and fulf
I’m not for bubble gum and duct tape. But I also realize that when I’ve got a
hammer everything begins to look like a nail.
After having made those two ambiguous statements I would ask myself, “What is
the problem I am trying to solve?” If you want to make your data available in a
linked data
Albiet a bit late, I very recently learned that the DOAJ is asking journals
like ours (Code4Lib Journal) to resubmit our application to be in the
directory. [1] From a Nature article:
Now, following criticism of its quality-control checks, the
website [DOAJ] is asking all of the journals in
> ...But there are few programmer projects that would require zero maintenance
> once finished…
This is a bit out of context, but a Buddhist monk once said, “Software is never
done. If it were, then it would be called hardware.” —Eric Morgan
The following message about the Crowdsourcing Consortium for Libraries and
Archives project is being forwarded upon request. —ELM
From: Christina
Subject: Please Distribute [Crowdsourcing Consortium for Libraries and
Archives project]
Date: August 27, 2014 at 10:15:16 AM EDT
…this mess
This is at test, and hopefully the only test of the Mr. Serials Process against
my Code4Lib mailing list archive. Delete me. —Eric
+ --ELM
On Jul 14, 2014, at 5:00 PM, Eric Lease Morgan wrote:
> Does anybody here have any experience with the Elsevier API Program? [1]
>
> [1] Elsevier API - http://www.developers.elsevier.com/cms/
I have had tiny success with the Elsevier API Program.
I first created an API key th
> On Oct 23, 2014, at 11:45 AM, Joe Hourcle
> wrote:
>
>> I found this blog post talking about CrossRef's support:
>>
>> http://www.crossref.org/CrossTech/2011/04/content_negotiation_for_crossr.html
>>
>> But I know DataCite supports it to some extent too.
>>
>> Does anyone know if there's ov
Learning Unix is not necessarily the problem to solve. Instead it is means to
an end.
To my mind, there are number of skills and technologies a person needs to know
in order to provide (digital) library service. Some of those
skills/technologies include: indexing, content management (databases
On Dec 2, 2014, at 10:01 AM, Heidi Elaine Dowding wrote:
> We are now accepting proposals for publication in our 28th issue, a special
> issue on diversity in library technology.
Cool. Have the set of editors for this diversity issue been identified, and if
so, then what are their names? —ELM
On Dec 2, 2014, at 10:08 AM, Heidi Elaine Dowding wrote:
>>> We are now accepting proposals for publication in our 28th issue, a special
>>> issue on diversity in library technology.
>>
>> Cool. Have the set of editors for this diversity issue been identified,
>> and if so, then what are their n
On Dec 8, 2014, at 12:57 PM, Dana Jemison wrote:
> Looks like the recommended hotel is already filled up. Are there any other
> options close by?
Mine is a unsolicited comment/endorsement for AirBnB as an additional source of
accommodations, if it does not hurt the conference planning process
On Dec 9, 2014, at 8:25 AM, Kyle Banerjee wrote:
> I've just started a project that involves harvesting large numbers of
> scanned PDF's and extracting information from the text from the OCR output.
> The process I've started with -- use imagemagick to convert to tiff and
> tesseract to pull out
On Dec 17, 2014, at 9:52 AM, Nicola Carboni wrote:
> I am collecting some resources (beginner level) in order to start using
> Virtuoso (OpenSource Edition) for a project I am working with. I would like
> to use it both for hosting triples and for its sponger (CSV to RDF). I
> sincerely never
On Dec 17, 2014, at 10:10 AM, Mixter,Jeff wrote:
> If you want to test out a bare-bones triple store, I would suggest 4Store
> (http://4store.org/). It has pre-compiled installs for Unix and Unix-like
> systems (although not Windows). It supports SPARQL 1.1 and is relatively easy
> to tweak/co
I don’t know about y’all, but it seems to me that things like linked data and
open access are larger trends in Europe than here in the United States. Is
there are larger commitment to sharing in Europe when compared to the United
States? If so, is this a factor based on the nonexistence of a nat
It is so cool that we have “franchises”. —Eric Morgan
I’m curious, how large is LITA (Library and Information Technology
Association)? [0] How many members does it have?
[0] LITA - http://www.ala.org/lita/
—
ELM
>> I’m curious, how large is LITA (Library and Information Technology
>> Association)? [0] How many members does it have?
>
> Apparently it has around 3000 members this year. I found this on the ALA
> membership statistics page:
>
> http://www.ala.org/membership/membershipstats_files/divisionstat
On Jan 5, 2015, at 11:25 AM, Sylvain Machefert
wrote:
>> Interesting and thank you. Code4Lib only needs fifty more subscribers to
>> equal LITA’s size. I think this just goes to show, with the advent of the
>> Internet, centralized authorities are not as necessary/useful as they once
>> used
On Jan 5, 2015, at 1:35 PM, Karen Coyle wrote:
> 1) Everyone should read at least the first chapters of the Allemang book,
> Semantic Web for the Working Ontologist:
> http://www.worldcat.org/title/semantic-web-for-the-working-ontologist-effective-modeling-in-rdfs-and-owl/oclc/73393667
+2 becau
Does anybody here know how to extract circulation statistics from an library
catalog? Specifically, given a date range, are you able to create a list of the
most frequently borrowed books ordered by the number of times they’ve been
circulated?
I have a colleague who wants to digitize sets of mo
The replies received have all been very helpful. Thank you! —Eric M.
It is a joy to manage this mailing list, and I say that with all sincerity.
—Eric Morgan
Can somebody point me to a good tutorial on how to index Word documents using
Solr?
I have a few hundred Microsoft Word documents I want to search. Through the use
of the Tika library it seems as if I ought to be able to index my Word
documents directly into Solr, but none of the tutorials I ha
On Feb 10, 2015, at 11:46 AM, Erik Hatcher wrote:
> First, with Solr 5, it’s this easy:
Where can I download Solr 5 because none of the other version seem to be
complete. —ELM
On Feb 10, 2015, at 11:46 AM, Erik Hatcher wrote:
> bin/post -c collection_name /path/to/file.doc
The almost trivial command to index a Word document in Solr, above, is most
certainly appealing, but I’m wondering about the underlying index’s schema.
Tika makes every effort to extract as much m
How do I retain diacritics in a Solr index, and how to I search for words
containing them?
I have extracted the plain text out of set of Word documents. I have then used
a Perl interface (WebService::Solr) to add the plain text to a Solr index using
a field type called text_general:
[The following announcement is being passed on by request. —ELM]
Assistant Manager – Websites and Online Engagement Digital Services
Vancouver Public Library (VPL) is seeking a dynamic, strategic, and creative
Assistant Manager to join the Digital Services department. Reporting to the
Manager,
I know the documents I’m indexing are written in Spanish, and adding the
following filters to my field definition, I believe I have resolved my problem:
In other words, my searchable content is defined thus:
And “text_general” is defined to include the filters in both the index and
q
On Feb 16, 2015, at 4:54 PM, Levy, Michael wrote:
> I think you can accomplish what you want by using ICUFoldingFilterFactory
> https://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.ICUFoldingFilterFactory
>
> which should simply perform ICU (cf http://site.icu-project.org/) based
>
Italy - https://www.facebook.com/KohaGruppoItaliano
—
Eric Lease Morgan
On Feb 25, 2015, at 2:48 PM, Esmé Cowles wrote:
>> In the non-techie library world, linked data is being talked about (perhaps
>> only in listserv traffic) as if the data (bibliographic data, for instance)
>> will reside on remote sites (as a SPARQL endpoint??? We don't know the
>> technical i
On Feb 25, 2015, at 3:12 PM, Sarah Weissman wrote:
> I am kind of new to this linked data thing, but it seems like the real
> power of it is not full-text search, but linking through the use of shared
> vocabularies. So if you have data about Jane Austen in your database and
> you are using the s
On Feb 26, 2015, at 9:48 AM, Owen Stephens wrote:
> I highly recommend Chapter 6 of the Linked Data book which details different
> design approaches for Linked Data applications - sections 6.3
> (http://linkeddatabook.com/editions/1.0/#htoc84) summarises the approaches as:
>
> 1. Crawling P
Code4Lib is now 3,082 subscribers strong. Yeah! Almost time to do some
analysis. —ELM
If a peson could denote the characteristics of both the main (female) character
as well as the protagonist, then bits of natural language processing (text
mining) might be able to address this problem. —Eric “When You Have A Hammer,
Everything Begins To Look Like a Nail” Morgan
as do to the large NLTK
library, but now I’m not so sure. It is just me, or is Python really s l o o o
w ? Is there anything I can do to improve/optimize my Python code?
—
Eric Lease Morgan
#!/usr/bin/env python2
# json2catalog.py - create a "catalog" from a set of HathiTrust json file
On May 18, 2015, at 9:23 PM, Galen Charlton wrote:
>> I have two scripts, attached. They do EXACTLY the same thing
>> in almost EXACTLY the same manner, but the Python script is
>> almost 25 times slower than the Perl script:
>
> I'm no Python expert, but I think that the difference is much more
atalog” — http://bit.ly/thoreau-catalog
[9] source code - http://ntrda.me/1Q8pPoI
[10] HathiTrust Research Center - https://sharc.hathitrust.org
—
Eric Lease Morgan, Librarian
University of Notre Dame
On May 26, 2015, at 11:30 AM, Eric Lease Morgan wrote:
> In my copious spare time I have hacked together a thing I’m calling the
> HathiTrust Research Center Workset Browser, a (fledgling) tool for doing
> “distant reading” against corpora from the HathiTrust. [0]
>
> [
On May 27, 2015, at 6:33 PM, Karen Coyle wrote:
>> In my copious spare time I have hacked together a thing I’m calling the
>> HathiTrust Research Center Workset Browser, a (fledgling) tool for doing
>> “distant reading” against corpora from the HathiTrust. [0, 1] ...
>>
>> 'Want to give it a t
On Jun 1, 2015, at 4:33 AM, davesgonechina wrote:
> If your *institutional* email address is not on their whitelist (not sure
> if it is limited to subscribing ones, they don't say) you cannot register
> using the signup form, instead you can only request an account by briefly
> explaining why yo
On Jun 1, 2015, at 10:58 AM, davesgonechina wrote:
> They just informed me I need a .edu address. Having trouble understanding
> the use of the term "public domain" here.
Gung fhpx, naq fbhaqf ernyyl fbeg bs fghcvq!! --RYZ
support for the digital humanities and text mining.
[1] HathiTrust Research Center - http://hathitrust.org/htrc
[2] blog posting describing the "Browser" - http://ntrda.me/1FUGP2g
—
Eric Lease Morgan
University of Notre Dame
I believe I have created a repository of my HTRC Workset Browser code (shell
and Python scripts) on GitHub. [1] From the Quick Start section of the README:
1. Download the software putting the bin and etc directories in the same
directory.
2. Change to the directory where the bin and etc dir
1 - 100 of 725 matches
Mail list logo