Archive for March, 2008

Class Notes – Metadata

http://www.niso.org/standards/ resources/UnderstandingMetadata.pdf

- good resource

- database has to be merged with 21st century

- contribute to knowledge about our discipline

- data that’s reliable and structured to talk to other data – need standards

- many standards developed to meet different needs – museums, archives, libraries

- visual resources association – states – building collections and information

- depends on type of information and collection

- many metadata schemes – many different types of information, needs to meet different needs – has different needs

- need to decide what type is right for you, the institution, the collection, data type etc.

- need to follow standard, need to find a system (technology) to provide interface to the metadata

- ryerson library – search other libraries – worldwide libraries – libdex – index of libraries (search by country or opac vendors – vendors for software for library catalogs)

-OPAC – online public access catalog

- software option – will need to decide – piece of software to implement in the instittution

- GEH – TMS (relationship metadata)

- look at software packages for archives, museums

- interoperability – diff. systems can communicate/share information – usually do this through crosswalks

- crosswalks – translations/translators,

- metadata – can be separate from the object (i.e marc record)

- encoded – metadata is in the object, tagging

- understand how complex the schemas are

- Crosswalks – mapping schemas to each other – migrating data

- data conversions – big issue for most places – need to be interoperable

Exercise-

- marc to dublin core: lcweb.loc.gov/marc/marcdocz.html

- proprietary systems vs. open source (which is what we have been working with – php, myadmin)

- archivist wants to use proprietary systems

Class Notes – Building web database

Step 1 – Create Table

- phpmyadmin – http://www.imagearts.ryerson.ca/jpascoe/myadmin/

- select your database

- select table to work on – if it exists

- if not -create it:

- will lilst all tables that database already contains

- at bottom of list – option to create new table

- can modify inClass2 or start from scratch

- table name – describes contents

- number of fields: (may not know exactly, but should have fairly good idea) – figure out what info you need before you get to this point

- can add and remove at will, but be careful when removing

- Dublin Core – 15 fields

- click go – it will show you big table

- first one/field – put in primary key – My Table_ID – always INT (integer) – name by convention is your table with ID at the end . Type is always INT. No need to enter values, except under:

- Extra – auto_increment.

- column immediately to the right, click the button, makes the row primary key (called primary)

- primary key – field in a record that guarantees its unique, has to be unique.

- if the object has accession number and it is guaranteed as unique, then it can be used as primary key.

- CHAR – fixed length, VARChar – variable length up to a maximum

- 2nd Field: Author – make sure you set the length to a length that you know the authors name will stay within (max 255 – char), space is a character, if over limit, it will cut off the name.100

- Make it text if you can’t guarantee the text is below 255 – take more processing power, min. amount will be too big. Only use if your sure it will be over 255. VARCHAR and char are tiny and fast data types.

- Null: not null means that field has to have a value, has to have data entered (database will reject it saying you haven’t filled in the form correctly). Null means its ok to not fill this field in. mySQL ignores this.

- Title: text

- Date: can be vague or accurate. if data is the type that you will have accurate dates – all digital images have creation date, photographs may not. Date data type constrains you – but it will allow for calculations to be done with it.

- If you can’t guarantee that dates will be in right format, then you need to use text based (VARCHAR, CHAR) – but searches where you’re actually calculating becomes trickier.

- Only have 3 or 4 options to enter date – why you use vocabularies and authorities

- Or you could just have a year field – use INT

- Date: VARCHAR 30 characters

- intYear: INT – duplicating data but allow for calculations – allows for searches in the year/dates

- Naming the fields – rules apply

- Medium – VARCHAR – 30 – predefined listing for names (authority) – make it the maximum length of those canonical names (make up list and then count the longest)

- Users will not see these field names

- caption- text (scope) – set to null, not all images have a caption

-with text you don’t have to enter value

- alt_author – varchar – 100

- order of fields doesn’t matter – queries that you ask the database that will structure responses.

- subject_person (controlled form of the person’s name) VARCHAR 100

- subject_corporation – VARCHAR 100

- subject_topic – VARCHAR 100

- location – text

- click save

- if there is something wrong, there will be red letters

- color coding based on what part of the table

- if you want to change one field – click on the pencil

- gets this list from a database query – what you see here is what your database contains

- proofread table – check spelling

- click on insert – can enter data straight into database – can test the table

Step 2 – Move Database into Dreamweaver

- leave the phpmyadmin window open, for reference

- enter dreamweaver

- works in last 3 versions of dreamweaver – does not work in dreamweaver mx

- create new site – not ftp and rds server

- first page of site definition dialogue

give site a name – don’t put any spaces in name

inClass2

- http address can fill in later

http://www.imagearts.ryerson.ca/jpascoe/inClass2/

- want to use server technology – php mysql

- edit locally, then upload to remote testing server (80% of the time)

- where to store files – at business: default will be your username and your documents folder -on your mac will be your site

- ryerson: don’t use default – can save to weston drive

- my computer – Data d drive- create folder name same as site name

- always confirm that its correct

- connect locally (usually) or connect by ftp (in this case) – if in grad lab connect to user name

- FTP

- ftp.imagearts.ryerson.ca (find out from whoever is managing your server what the ftp is)

- folder to store files: ppcm07/jpascoe

- login: username

- password

- test connection – don’t move until you have connected to web server successfully

- URL: always have to fix this -

change ftp to http://www.imagearts.ryerson.ca/jpascoe/inClass2 (folder name that you put your site in)

- same answer as first page

- error message – click ok and move on

- site definition: no, to check in and check out

- done

- top folder is root folder

- right click on folder – create new – images (copy images into this folder)

- manage sites – edit – advanced – default images folder – navigate images folder we just created and select it

- Make a page

- Click on root folder – Create new file instead of new folder

- Descriptive title for pages – insert.php – open

- give page a title – insert all

- send data between server and page – create a form

- needs entry fields and submit button – send into database

- insert – 3rd button – forms – first button – create new form

- form – text entry fields – make connection to database

- application – databases – + – mysql connection -

- connection name- anyname you want (rules apply)

- my sql server – localhost

- user name -ryerson user name

- password – last 7 digits of student number

- click select – connecting to databases that you have – your name, and information_schema (never use this one) – select database that has same name as username

- username should appear in last field

- when you work somewhere else – database name will be different

- drum – database

- click + sign by database connection – results in three items – only one you care about is tables – click + by tables – click + by name of table to see the fields table contains

- can see the name of each field when creating insert form in site

- insert bar – forms – new form

- cursor dashing inside red square – part of form

- Form elements – text field

- Text field – single line entry – ID for text entry should match field name

- ID: filename (get caps the same) – instructions to computer

- Label: what data entry person will read – please enter the filename

- dpme

- label shows up – text field is there

- click right arrow to put cursor where u want it

- repeat

- ID: category, label: please enter the category

(will know based on ID how to link it to the table)

- Caption: need text area instead of field (next button) – because its longer

- label: type in the caption

- need two buttons: submit and reset

- button – on the forms section

- reset: highlight submit button – properties panel (bottom of page) – Action – 2 choices, submit or reset (by default it is submit) – click reset – button changes to reset – don’t need label

-SAVE

- need to tell dreamweaver what to do when buttons are clicked

- no field for primary key – don’t want to send value, already auto-increment

- need to define server behavior

- application -server behaviour – click + – insert record – which form to send (not advised to have more than one form per page) -

Insert record

automatically form 1

connection: my connection

insert table: inclass2

columns:

- database gets values from dreamweaver form

- after inserting , go to: insert2.php (select webpage you want it to go to after it sends the data – have it go back to this page)

- little golden dot at bottom of form – insert stuff into database and return to this page

- upload and test – yes to dependent files

- test it in a browser – http://www.imagearts.ryerson.ca/jpascoe/inclass2/insert2.php

- worked if now mysql errors – get question mark at end of url

- type in filename of image correctly

- create page to view records

- under site – new file

in class assignment – do it next week (niso framework document – table 2 – list of metadata) – dublin core (only 15 fields)

content standard – online template

think about database assignment – what fields you need

http://www.ncecho.org/

- viewrecords.php

- title – Viewrecords

- need to show records from database

- bind data in webpage to data in database – tab name is bindings

- application – bindings

- bindings tab – click + – recordset (query)

- think of records set as a search – defining a search

- name only matters if you have alot of recordsets – connection is the one already set up – will fit in the table with first one it finds – select table you want to grab data from – columns all – filter (none – want all of them) – sort by filename – ascending (can do author, last name etc.)

- Test will do the query for you

- inclass ID – primary keys generated by sql

- bindings – recordset (drum=table) – click +

- lightning bolts – dynamic data (database data)

- use a table to organize

- Table: insert bar – common or layout – table – one column and as many rows as the fields you want to show – i.e 4 fields you put in 4 rows

- table width – 200 min. (table will expand to fit whatever is in it)

ok

- first item you want to be the image – common -images – image – select image source – select file name from data sources – recordset will appear – filename; format – none; url will need to change, put in before angle bracket type in: images/

ok

- type something into alternate text – alt attribute – if image doesn’t appear – this text will show up (not something specific since don’t knowwhat image)

- other three rows will be info from database – drag lightning bolt fields from recordset and drop into table

SAVE

Files tabs – highlight viewrecords.php – upload and put dependent files

test webpage in browser -

if filename is wrong – get error for picture

- this is why you keep myadmin open

- set repeated region – make dreamweaver automatically repeat

- go back to dreamweaver (always test)

- select table (click inside what you want to select – dreamweaver shows hierarchy along bottom of window – click button table)

- application – server behaviour – click + – repeat region; show – all records; makes it show all records on the same page

- repeat region = grey square – everything inside will get repeated

- SAVE and UPLOAD

- refresh view on browser

- shows everything in database

- if you have stuff on the back

- have 2 fields for filename

- add one more field – add one at end of table

- field: filename_verso

- digital scan is part of the data

2 weeks to do in class assignment (do this again with all 10 of the images)

- dublin core – embedded metadata in database

Framework

Metadata-

- Metadata is structured information associated with an object.

- Different types of metadata can be added by different people at various stages of an information object’s life cycle.

- 3 basic kinds of metadata

- Descriptive metadata – helps users find objects and understand the subject or contents of objects.

- Administrative metadata – helps collection managers keep track of objects

- Structural metadata – documents relationship among objects

- creating broadly accessible descriptive metadata is a way to maximize access by current users and attract new user communities

- use of a standardized subject thesaurus provides greater precision in searching for end users and enables future functionality

- The approach to metadata standards must be made within the context of the organization’s pupose for digitizing the collection (users and intended usage, approaches adopted within the community, and the desired level of access)

Questions:

- Purpose of digital collection

- User needs and intended usage

- metadata standard selection and usage

Metadata Principle 2

-good metadata supports interoperability

- goal of interoperability: help users find and access information objects that are distributed across domains and institutions

- use of standard metadata schemes facilitates interoperability by allowing records to be exchanged and shared by systems that support the scheme

- another way is to map elements fromone scheme to those of another scheme – crosswalks

- helps users of one scheme to understand another

- allow records created according to one scheme to be coverted by program to another

for example: if a local metadata scheme is in use instead of a standard scheme, a crosswalk to some standard scheme should be develped for furture interoperability

- open archives initiative – harvestin protocol

- “metasearch” – cross-system searching – metadata remains in the source repository, but local search system accepts queries from remote search systems

3. Good metadata uses authority control and content standards such as controlled vocab

- consider vocabularies/knowledge of users

- whatever vocabularies are chosen, their use should be documented and guidelines provided to help metadata creators select terms consistently

Getty Vocabulary Program – thesauri for the visual arts

4. Good Metadata includes a clear statement on the conditions and terms of use for the digital object

5. Good Metadata supports the long-term management of objects in collections

- administration metadata: facilitate the management of resources

- preservation metadata: administration metadata simed at supporting the long-term retention of digital objects – needs to be compatible with worksflow – digital objects descriptions

- structural metadata relates the pieces of a compound object together

6. Good metadata records are objects themselves and should have the qualities of good objects

- “Meta-metadata” – stored info about metadata, should include the id of the institution that created it and what standards were used

Reading Notes – Intro to Vocabularies

 (ww.getty.edu/research/conductin_research/vocabularies/)

1. What is art and material culture?

- material culture refers to the broader realm of physicals objects produced by a culture

- cultural heritage – activities and the artifacts of activities that are a record of the life of a culture.

- many benefits of making art and material culture info available, including preserving the complexities and quality of cultural information for the use of future generations while making it accessible to people today.

- Images need the benefit of words, contextual information, otherwise it is “silent”, cannot tell the complete story

- Need contextual information to be fully appreciated.

- only when documented and interpreted can it tell the story

- Intellectual links (e.g. people, places etc) exist in art and material cultre and are made explicit in the analysis and recording of info.

- the role of documentation professions: organizing and managing information – this is essential in providing interpretation, analysis, and access to art and material culture.

Art and Material Culture Information: Ten Points

- making information available about art helps to encourage exploration by identifying universal themes, explaining contexts, and realing little-known facts.

- information about material culture provides context and interpretation necessary to “tell a story”. Makes factual information more meaninful and accessible.
- new internatioal audeiences

- Focusing on an “act of creation” : a work of art and the context of its creation form the core of art and material culture information

- primary reason for recording information about art is to document importan objects in order to protect and care for them

- also combined with additional research and related information in order to educate and entertain. raising central issues: intellectual property rights, historical accuracy, and multicultural points of view.

- Having a Point of View: the way material culture info is presented will reflect a particular point of view.

- full story of art and material culture is best told using words and pictures, not numbers.

- complexity of information presents a challenge to creating access

2. Documentation: Analyzing and Recording Information

- Documentation includes analyzing, organizing, and recording information in order to provide access to cultural heritage resources.

- dependent upon the holders of the information and the information itself

- 4 major approaches to documentation of material culture: archival, library, museum, and visual resources.

- there is a re-examining of these traditional practices in an attempt to deal with the digitization and networked access to materials.

The Archival Approach

- the arrangement and description of records, personal papers, and manuscripts.

- emphasizes the function and provenance of archival materials

- documents, images, artifacts etc.

- based on the creation of a finding aid – document that lists or describes a body of records w/in an archive, providing access to the user.

- describes collections, series and groups of related materials – instead of individual items

- based on well-established standards (MARC, RAD, APPM, EAD)

- includes methodologies

- based on the concept htat the collection “in hand” is unique material

- uses controlled vocabularies

- moving towards data-sharing initiatives – need for global access to unique and primary research material.

The Library Approach

- involves the cataloging and classification of books and other published textual materials. (bibliographic cataloging and classification)

- based on the concept that the item is one of many of the same things – data-sharing is seen as economically advantageous, copy cataloging is cheaper than original cataloging)

- guided by principles from national leader institutions (LOC)

- taught in schools of library and information science – curriculum: controlled vocabularies, authority practice, and subject analysis

- high value on subject access

- long-standing tradition of data sharing (OCLC)

- based on well-established standards: MARC format and AACR2

- authority work and controlled vocabularies (LCSH – most prevalent)

- computerized in the 1970s: public access to cataloged materials

- item-level cataloging (record is created for each item, not collection)

The Museum Approach

- involves the documentation of museum objects (works of art, artifacts, and specimens)

- complex, incoporating diverse topics of information about an object (physical description, provenance, conservation, photographic documentation and research data)

- item “in hand” is unique

- increase in data-sharing initiatives in the last several years (CHIN national inventories, 1972)

- uses classification schemes often based on departmental divisions

- incorporates images as well as textual data

- beginning to adopt authority control as basic practice

- gained importance as a valuable resource – re-purposed for public information systems – presenting info about collections to the public thru the internet.

- uses collection management systems that are geared to internal users (curators, registrar etc.) (K-EMU)

- uses diverse sources for terminology (ULAN, AAT, TGN and local lists)

- beginning to adopt standards such as CDWA

Visual Resources Approach

- provides access to images that enrich the ducation experience – many VR collections are in a university environment

- single items and set of images – complex levels of description – both item in hand and the content of the image

- working towards adoption of data standards (“Core Categories for Visual Resources” and MARC)

- vocab: AAT, ULAN, TGN and local lists

- places high value on subject access

- close ties with the museum documentation tradition in the description of iimages of museum objects

Vocabularies are the bridge

- vocabularies offer a common ground for different approaches – bring together resources

1. vocabularies being used are independent of systems, applications, and media

2. Vocabularies are structure to provide links between terms and concepts – creates an intellectual path

Follow

Get every new post delivered to your Inbox.

Join 58 other followers

%d bloggers like this: