Friday, October 23, 2009

LIS 2670 week8 Access in digital libraries part 2 reading notes

  • Chapter 1. Definition and Origins of OAI-PMH.oai-pmh-ch1.pdf
  • Todd Miller, Federated Searching: Put It in Its Place . April 15, 2004. http://www.libraryjournal.com/article/CA406012.html&
  • The Truth About Federated Searching. October 2003. http://www.infotoday.com/it/oct03/hane1.shtml
  • Lynch, Clifford A. (1997). The Z39.50 Information Retrieval Standard, Part 1: A Strategic View of its Past, Present, and Future. D-Lib Magazine, April 1997. http://www.dlib.org/dlib/april97/04lynch.html
  • Norbert Lossau, “Search Engine Technology and Digital Libraries: Libraries Need to Discover the Academic Internet” D-Lib Magazine, June 2004, Volume 10 Number 6. http://www.dlib.org/dlib/june04/lossau/06lossau.html


  • 1, OAI-PMH
    The open archives initiative protocol for metadata harvesting is designed for reach interoperability between digital libraries and facilitate efficient dissemination of information.
    This paper introduces the basic definition and purpose of OAI-PMH. I think the most important thing is to understand the meaning of OAI-PMH in digital libraries.


    2,
    Todd Miller proposes a new relationship between federated search engines and the library catalog
    It is true that catalogs are rigid in digital age, since information are not limited to physical pattern that can find in one place. And most of content that required by users are not cataloged by libraries. When I use ebay, I find that the information retrieval system is very useful and ebay can provide high relevant results to me, describe content that I need for the commodities.


    3, the truth about federated searching
    This article talks several demerits and problems about federates search engines. I agree with the author's opinions, however, it is common that new things have existed some demerits and I believe scientists will work out these problems in the near future.


    4, the Z39.50 information retrieval standard
    the paper's focus is on how and why Z39.50 developed the way it did, and the conceptual debates that have influenced its evolution and use.


    5, search engine technology and digital libraries
    academically relevant content is very interesting to me.












    LIS 2670 week7 Access in digital libraries part 1 muddist point

    1, what does the sentence "ignore documents that try to 'spam' the text " mean in ppt 54?
    2. Could you explain the image in ppt55 again? I do not understand it.

    Saturday, October 17, 2009

    LIS 2670 week7 Access in digital libraries part 1 reading notes

  • LESK chapter 4.
  • David Hawking , Web Search Engines: Part 1 and Part 2 IEEE Computer, June 2006. http://www.computer.org/portal/site/computer/menuitem.5d61c1d591162e4b0ef1bd108bcd45f3/index.jsp?&pName=computer_level1_article&TheCat=1055&path=computer/homepage/0606&file=thingswork.xml&xsl=article.xsl&;
  • M. Henzinger et al. challenges in Web Search Engines. ACM SIGIR 2002. http://portal.acm.org/citation.cfm?coll=GUIDE&dl=GUIDE&id=792553

    I do not find the readings of the first and the second one. I will put my reading notes when I find the articles. I have searched the second article in IEEE database, but I do not find this paper.

    Challenges in web search engines:
    In this paper, the author points out several challenges that faced by current web search engines. I do think spam is anywhere in today's search engines. And the precision rate is not very high. When we do some searches in search engine, only top -ranked results can meet my needs. The other results are more like trash. Now, we are in a sea of information and advertisement. Relevance is very important in current search engines.

    LIS 2670 week 6 XML and Markup Languages muddist point

    1, what dose the sentence " 80% of SGML' s power at 20% of its complexity." mean?

    2, What is the difference between the URI in an XML namespace and the URL?

    3, Which is more popular, DTD or XML schema? Speaking of digital objects, which of two do we use?

    4, Why the defaults of minOccures and maxOccurres are both 1 in PPt45? I thought the default of minOccures is 0.

    Friday, October 9, 2009

    LIS 2670 week 6 XML and Markup Languages reading notes

    1,An Introduction to the Extensible Markup Language (XML)

    This paper basically present the components of XML, the relationship between SGML and HTML.
    XML is really important in digital world and we have to learn and use XML.
    I am ready to learn it.


    2, XML Schema Tutorial

    This is interesting. The tutorial is simple but it convey lots of information about XML. And many examples are given to prove the utilization of XML.

    3,A survey of XML standards. Part.1

    The core standards -- a foundation for the wide world of XML

    In this series of articles, Uche Ogbuji provides a guide to XML standards,

    The most important is DTD. Within these standards, XML is easier to understand and more standardized.

    I need more time to figure out XML and handle it.

    Thursday, October 8, 2009

    LIS 2670 week5 metadata in digital libraries muddist point

    Identifiers in Dubline Core refers to DOI or URL?
    In the example: "
    Identifier = http://www.cs.cornell.edu/wya/DigLib/new/index.html."
    I thought this refers to url? why?

    Friday, October 2, 2009

    LIS 2670 week4 DL system Demo muddist point

    I have nothing to comment on this week.

    LIS 2670 Assignment 2 Part 1 Flickr link

    http://www.flickr.com/photos/yuqihelucky/