{"id":2719,"date":"2014-07-06T20:00:35","date_gmt":"2014-07-07T00:00:35","guid":{"rendered":"http:\/\/www.maineinfonet.org\/mscs\/?page_id=2719"},"modified":"2015-07-04T00:46:44","modified_gmt":"2015-07-04T04:46:44","slug":"hathiexport","status":"publish","type":"page","link":"http:\/\/www.maineinfonet.org\/mscs\/about\/people\/technical-services-subcommittee\/hathiexport\/","title":{"rendered":"Hathi Members:  Exporting  Holdings"},"content":{"rendered":"<p>HathiTrust members are required to update their holdings yearly.<\/p>\n<ul>\n<li><a style=\"font-style: normal;\" href=\"http:\/\/www.hathitrust.org\/print_holdings\">Print Holdings file requirements from HathiTrust<\/a><\/li>\n<li>Files exported yearly around July 1: <a style=\"font-style: normal;\" href=\"https:\/\/www.dropbox.com\/sh\/2aclaejp73o6sib\/AAAKZFSbRCAWyfkEjahmcB7ya\">MEU<\/a> , <a style=\"font-style: normal;\" href=\"https:\/\/www.dropbox.com\/sh\/yieg5qizq9w05oh\/AACJ02EuaoPTuv_0IqF_NWSra?dl=0\">CBY<\/a>\u00a0(includes list of OCLC numbers submitted.)<\/li>\n<li><a href=\"#createlists\">Instructions for Creating Lists<\/a><\/li>\n<li><a href=\"#export\">Instructions for\u00a0Exporting and Formatting Files <\/a><\/li>\n<li><a href=\"#reminder\">Reminder of Leader and FormItem values<\/a><\/li>\n<\/ul>\n<p><a name=\"createlists\"><\/a><strong>Instructions for Creating Lists:<\/strong><\/p>\n<p>Hathi requires 3 separate .tsv (tab separated values) files.<\/p>\n<ol>\n<li>Single-part monographic holdings &#8211; leader 07 (bib level) = m or\u00a0i<\/li>\n<li>Multi-part monographic holdings &#8211; leader 07 (bib level) = a or c<\/li>\n<li>Serial holdings \u00a0&#8211; leader 07 (bib level) =s, b, or d<\/li>\n<\/ol>\n<p>Below are links to search strategies used in CBBCat and URSUS to pull print materials lists, including government documents. \u00a0Searches include restricting to materials with a valid OCLC 001, and eliminating form items equal to non print formats. \u00a0Note that in Create Lists the FormItem\u00a0search fields are different based on rec type. \u00a0Thus there are 3 separate lists created for single multi part monographs based on rec type (Note that it might be more efficient to dump the marc records as one batch and parse through them with a script). \u00a0More on FormItem can be found in <a href=\"http:\/\/csdirect.iii.com\/manual\/gmil_lists_specify_criteria_spflds_bib.html\">CSDirect<\/a>. \u00a0Note also that suppressed and withdrawn items are included as these are still considered holdings by Hathi. (See note below on tracking withdrawn materials.)<\/p>\n<p><a title=\"Saved Searches for Hathi Export \u2013 Colby\" href=\"http:\/\/www.maineinfonet.org\/mscs\/about\/people\/technical-services-subcommittee\/hathiexport\/hathicbysavedsearches\/\">Colby Saved Searches<\/a>:<\/p>\n<table>\n<tbody>\n<tr>\n<td><i>Hathi-CBY-Single-part-monograph-at<\/i><\/td>\n<td><i>Hathi-CBY-Multi-part-monograph-at<\/i><\/td>\n<\/tr>\n<tr>\n<td><i>Hathi-CBY-Single-part-monograph-cd\u00a0<\/i><\/td>\n<td>\u00a0<i>Hathi-CBY-Multi-part-monograph-cd<\/i><\/td>\n<\/tr>\n<tr>\n<td><i>Hathi-CBY-Single-part-monograph-ef<\/i><\/td>\n<td>\u00a0<i>Hathi-CBY-Multi-part-monograph-ef<\/i><\/td>\n<\/tr>\n<tr>\n<td>\u00a0Hathi-CBY-Series<\/td>\n<td><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Note below the fields that were used for Form Item in each saved search:<br \/>\n<img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-3257\" src=\"http:\/\/www.maineinfonet.org\/mscs\/wp-content\/uploads\/SearchStrategies-300x148.png\" alt=\"SearchStrategies\" width=\"400\" height=\"197\" srcset=\"http:\/\/www.maineinfonet.org\/mscs\/wp-content\/uploads\/SearchStrategies-300x148.png 300w, http:\/\/www.maineinfonet.org\/mscs\/wp-content\/uploads\/SearchStrategies-160x79.png 160w, http:\/\/www.maineinfonet.org\/mscs\/wp-content\/uploads\/SearchStrategies.png 1006w\" sizes=\"auto, (max-width: 400px) 100vw, 400px\" \/><\/p>\n<p>University of Maine (Orono): Saved search: Hathi-MEU-Monographs (Used in 2014 &#8211; may need to be updated to above form item fields)<\/p>\n<pre style=\"padding-left: 30px;\">(BIBLIOGRAPHIC  MARC Tag 001  starts with  \"ocn\"    OR \r\nBIBLIOGRAPHIC  MARC Tag 001  starts with  \"ocm\"    OR \r\nBIBLIOGRAPHIC  MARC Tag 001  matches  \"^001..(|a){0,1}[0-9]+\") AND \r\n(BIBLIOGRAPHIC  BRANCH  starts with  \"d\"    OR \r\nBIBLIOGRAPHIC  BRANCH  starts with  \"o\")    AND \r\n(BIBLIOGRAPHIC  REC TYPE  equal to  \"a\"    OR \r\nBIBLIOGRAPHIC  REC TYPE  equal to  \"c\"    OR \r\nBIBLIOGRAPHIC  REC TYPE  equal to  \"d\"    OR \r\nBIBLIOGRAPHIC  REC TYPE  equal to  \"e\"    OR \r\nBIBLIOGRAPHIC  REC TYPE  equal to  \"f\"    OR \r\nBIBLIOGRAPHIC  REC TYPE  equal to  \"t\")    AND \r\n(BIBLIOGRAPHIC  BIB LEVL  equal to  \"m\"    OR \r\nBIBLIOGRAPHIC  BIB LEVL  equal to  \"i\")    AND \r\nBIBLIOGRAPHIC  FormItem  not equal to  \"a\"    AND \r\nBIBLIOGRAPHIC  FormItem  not equal to  \"b\"    AND \r\nBIBLIOGRAPHIC  FormItem  not equal to  \"c\"    AND \r\nBIBLIOGRAPHIC  FormItem  not equal to  \"o\"    AND \r\nBIBLIOGRAPHIC  FormItem  not equal to  \"q\"    AND \r\nBIBLIOGRAPHIC  FormItem  not equal to  \"s\"    AND \r\nBIBLIOGRAPHIC  BRANCH  not equal to  \"oweb \"    AND \r\nBIBLIOGRAPHIC  BRANCH  not equal to  \"owebb\"    AND \r\nBIBLIOGRAPHIC  BRANCH  not equal to  \"dweb \"    AND \r\nBIBLIOGRAPHIC  BRANCH  not equal to  \"dwebb\"\r\n<\/pre>\n<p>See also the saved searches:\u00a0<em>Hathi-MEU-Multi-part-monograph \u00a0&amp; Hathi-MEU-Series<\/em><\/p>\n<p><a name=\"export\"><\/a><strong>Instructions for Exporting and Formatting Files:<\/strong><\/p>\n<p>Data can be exported using the Saved Export &#8220;Hathi&#8221;. \u00a0 This exports 001, bib number, 022, 008 position 28 (gov doc<a href=\"#note\">*<\/a>), and item Volume fields in a tab delimited format with multiple fields separated by semicolons. Note that the &#8216;Field delmiter&#8217; &lt;9&gt; is for the tab format.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-2750\" src=\"http:\/\/www.maineinfonet.org\/mscs\/wp-content\/uploads\/IIIHathiExport.png\" alt=\"IIIHathiExport\" width=\"627\" height=\"350\" \/><\/p>\n<p><a name=\"note\"><\/a>*Since position 28 in the 008 of music (scores) is not Gov Docs this may result in a few scores being coded as gov docs for Hathi, but the numbers are negligible.<\/p>\n<p>Hathi requires different fields to be included in the 3 files, as outlined below. \u00a0 They also request that government documents be flagged as such if possible. \u00a0 The exported files can either be edited manually to conform to the required specifications, changing the gov doc values of &#8216;s&#8217; and &#8216;f&#8217; to &#8216;1&#8217; and all others (including blank) to &#8216;0&#8217;, or you may use <a href=\"https:\/\/www.dropbox.com\/s\/8w50ne2fcyddaeo\/IIIExportFormatHathi.pl\">this perl script<\/a>. \u00a0If using the script simply change the 4 variables at the top (infile, outfile, oclcfile and outputformat) to match your scenario.<\/p>\n<p><b>Fields and Filenames\u00a0\u00a0for Hathi Submission in Tab Delimited files:<\/b><\/p>\n<p style=\"padding-left: 30px;\">Single Part Monograph ( filename: {symbol}_single-part_yyyymmdd.tsv):<br \/>\nOCLC , Bib #, Holding Status (blank), Condition (blank), Gov Doc<br \/>\nMulti Part Monograph: ( filename: {symbol}_multi-part_yyyymmdd.tsv):<br \/>\nOCLC , Bib #, Holding Status (blank), Condition (blank), Vol\/Copy, Gov Doc<br \/>\nSeries: ( filename: {symbol}_serials_yyyymmdd.tsv)<br \/>\nOCLC, Bib #, ISSN, Gov Doc<\/p>\n<p>Also note that Hathi will allow some access to materials that have been lost and withdrawn. This requires the member libraries to maintain a list of withdrawn material. The dropbox folders listed at the top of this document include lists of OCLC numbers output, and the script mentioned in the previous paragraph will generate a list of OCLC numbers of the current export. \u00a0By comparing the two lists it would be possible to determine what has been withdrawn over the year. \u00a0 \u00a0Here&#8217;s another <a href=\"https:\/\/www.dropbox.com\/s\/nsmrrq9c731wxzg\/CompareListsofOCLC.pl\">small perl script to compare lists of OCLC numbers<\/a>, and <a href=\"https:\/\/www.dropbox.com\/s\/rxr4z4oyn4yn6cb\/AppendMissingHathi.pl?dl=0\">append missing numbers to monographs file<\/a> (assuming most of these would be monographs.)<\/p>\n<p><strong>Number of records Submitted:<\/strong><\/p>\n<table border=\"1\" cellspacing=\"2\" cellpadding=\"2\">\n<tbody>\n<tr>\n<td align=\"LEFT\" bgcolor=\"#E6E6FF\">Colby &#8211; 2015<\/td>\n<td align=\"LEFT\" bgcolor=\"#E6E6FF\"><\/td>\n<\/tr>\n<tr>\n<td align=\"LEFT\">Single-part monographs<\/td>\n<td align=\"RIGHT\">463,552<\/td>\n<\/tr>\n<tr>\n<td align=\"LEFT\">Multipart monographs<\/td>\n<td align=\"RIGHT\">89<\/td>\n<\/tr>\n<tr>\n<td align=\"LEFT\">Serials<\/td>\n<td align=\"RIGHT\">3,638<\/td>\n<\/tr>\n<tr>\n<td align=\"LEFT\"><\/td>\n<td align=\"LEFT\"><\/td>\n<\/tr>\n<tr>\n<td align=\"LEFT\" bgcolor=\"#E6E6FF\">University of Maine &#8211; 2014<\/td>\n<td align=\"LEFT\" bgcolor=\"#E6E6FF\"><\/td>\n<\/tr>\n<tr>\n<td align=\"LEFT\">Single-part monographs<\/td>\n<td align=\"RIGHT\">1,110,693<\/td>\n<\/tr>\n<tr>\n<td align=\"LEFT\">Multipart monographs<\/td>\n<td align=\"RIGHT\">825<\/td>\n<\/tr>\n<tr>\n<td align=\"LEFT\">Serials<\/td>\n<td align=\"RIGHT\">42,449<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><a name=\"reminder\"><\/a><strong>Reminder of Leader and FormItem values\u00a0<\/strong><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-2755\" src=\"http:\/\/www.maineinfonet.org\/mscs\/wp-content\/uploads\/leaderhints.png\" alt=\"leaderhints\" width=\"637\" height=\"299\" srcset=\"http:\/\/www.maineinfonet.org\/mscs\/wp-content\/uploads\/leaderhints.png 637w, http:\/\/www.maineinfonet.org\/mscs\/wp-content\/uploads\/leaderhints-160x75.png 160w, http:\/\/www.maineinfonet.org\/mscs\/wp-content\/uploads\/leaderhints-300x140.png 300w\" sizes=\"auto, (max-width: 637px) 100vw, 637px\" \/><\/p>\n<hr \/>\n<h6 style=\"text-align: center;\"><a href=\"http:\/\/www.maineinfonet.net\/mscs\/\">MSCS<\/a> &gt;&gt; <a href=\"http:\/\/www.maineinfonet.net\/mscs\/about\/people\/\">People<\/a> &gt;&gt; <a href=\"http:\/\/www.maineinfonet.net\/mscs\/about\/people\/technical-services-subcommittee\/\">Technical Services Subcommittee<\/a>\u00a0&gt;&gt;\u00a0Hathi Members: Export Holdings<\/h6>\n","protected":false},"excerpt":{"rendered":"<p>HathiTrust members are required to update their holdings yearly. Print Holdings file requirements from HathiTrust Files exported yearly around July 1: MEU , CBY\u00a0(includes list of&hellip;<\/p>\n","protected":false},"author":5,"featured_media":0,"parent":202,"menu_order":0,"comment_status":"open","ping_status":"open","template":"","meta":{"footnotes":""},"class_list":["post-2719","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"http:\/\/www.maineinfonet.org\/mscs\/wp-json\/wp\/v2\/pages\/2719","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/www.maineinfonet.org\/mscs\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"http:\/\/www.maineinfonet.org\/mscs\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"http:\/\/www.maineinfonet.org\/mscs\/wp-json\/wp\/v2\/users\/5"}],"replies":[{"embeddable":true,"href":"http:\/\/www.maineinfonet.org\/mscs\/wp-json\/wp\/v2\/comments?post=2719"}],"version-history":[{"count":30,"href":"http:\/\/www.maineinfonet.org\/mscs\/wp-json\/wp\/v2\/pages\/2719\/revisions"}],"predecessor-version":[{"id":3265,"href":"http:\/\/www.maineinfonet.org\/mscs\/wp-json\/wp\/v2\/pages\/2719\/revisions\/3265"}],"up":[{"embeddable":true,"href":"http:\/\/www.maineinfonet.org\/mscs\/wp-json\/wp\/v2\/pages\/202"}],"wp:attachment":[{"href":"http:\/\/www.maineinfonet.org\/mscs\/wp-json\/wp\/v2\/media?parent=2719"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}