avocets
Avocets
rss 2.0 subscribe to this page
search


view all
•  projects
•  owners
•  tags

Great article from a reference librarian perspective on how to use LSCH subdivision terms to refine keyword searches.

"

This hosted service represents a new generation metasearch service. It displays results quickly and simply, and gives the searcher unparalleled control over the search experience. These benefits are accomplished through features such as

* simultaneous searching of up to 100 databases almost instantly
* relevance ranking, sorting and merging of records across databases
* processing of returned hits at the rate of 2000 records per second
* faceted searching by resource, subject, author, date or any library-defined element

MasterKey's simplicity and power provide a metasearch experience unequaled by any other metasearch service."

 

John discusses his program to add LC subject headings to titles in the Online Books Page based on the LC classification

From Glenn E. Patton:

"To get a sense of which MARC data elements are used in the various indexes for WorldCat, take a look at "Searching WorldCat Indexes" (http://www.oclc.org/support/documentation/worldcat/searching/searchworl
dcatindexes/). Click on "List of Fields and Subfields Indexed in Tag Order" in the navigation bar on the left of the screen to see a list of MARC elements and the indexes in which they are used."

 

 

tagged marc oclc searching by bethpc ...on 13-JUN-08

"Ten things Google has found to be true".  Is there a message for libraries?  

tagged google searching by bethpc ...on 02-MAY-08

Interesting post on federated searching. Why imporved mapping doesn't work, and what we might do to improve the results sets for users.

 

tagged future_of_catalogs searching by bethpc ...on 23-APR-08

What Have We Got to Lose? The Effect of Controlled Vocabulary on Keyword Searching Results

Tina Gross and Arlene G.Taylor

tagged keywords searching subject_authorities by bethpc ...on 15-APR-08
Tips to keep in mind when searching Medline.
JSTOR
JSTOR specializes in making available the back issues of journals in a wide variety of humanities and social science disciplines. Issues are available both as images and as text, making searching possible both within each title and across the whole database.

 

Penn's subscription currently includes all the available collections:

  • the Arts & Sciences Collection I, II, III, IV and the complement
  • the Business Collection
  • the Ecology & Botany Collection
  • the General Science Collection
  • the Language and Literature Collection
  • the Music Collection

 

Access to journals from JSTOR is restricted to current Penn faculty, staff and students.

Printing from the JSTOR database requires downloading a helper application called JPrint.
Holdings: active

tagged journal searching by polyn ...and 63 other people ...on 29-MAY-07
PsycINFO
-from CSA Databases
The American Psychological Association's comprehensive indexing and abstracting service for the professional and scholarly literature in psychology and related fields. Coverage is worldwide. Sources are in English and over thirty languages.
Holdings: 1887 to the present. Updated monthly.
tagged psychology searching by polyn ...and 33 other people ...on 06-MAR-07
ISI Citation Indexes (Web of Science)
The ISI Citation Indexes include the Science Citation Index, the Social Science Citation Index, and the Arts & Humanities Citation Index.
Search for specific articles by subject, author, journal, and/or author address, as well as for articles that cite a known author or work.

Holdings: Science Citation Index Expanded (SCI-EXPANDED)--1945-present
Social Sciences Citation Index (SSCI)--1956-present
Arts & Humanities Citation Index (A&HCI)--1975-present
Updated weekly.
tagged searching by polyn ...and 40 other people ...on 19-DEC-06
How do you find your way in an age of information overload? How can you filter streams of complex information to pull out only what you want? Why does it matter how information is structured when Google seems to magically bring up the right answer to your questions?
NC States project page -- the public page at least.

CQL::Parser is a Perl module for parsing Common Query Language statements.

CQL is a formal language for representing queries to information retrieval systems such as web indexes, bibliographic catalogs and museum collection information. The design objective is that queries be human readable and writable, and that the language be intuitive while maintaining the expressiveness of more complex languages.

CQL::Parser will allow you validate statements, and parse them into a parse tree which you can then programatically walk and use. For your convenince there are methods for converting the CQL parse tree into Swish and Lucene queries as well as XCQL (an XML representation of CQL).

  use CQL::Parser;
  my $parser = CQL::Parser->new();
  my $root = $parser->parse('dc.creator="clinton"');
  my $swish = $root->toSwish();
  my $lucene = $root->toLucene();
  my $xcql = $root->toXCQL();
tagged SRU discovery dlf_spring_2006 searching by winkler4 ...on 11-APR-06
Nice layout to look at for LoST
tagged cni_spring_2006 development lost searching by winkler4 ...on 04-APR-06
"To make this global panopticon a reality, unique identification tags are necessary but not sufficient. As we know from library science research, known-item searches account for less than half of total demand. Most users will also want to perform exploratory subject searches." woohoo!
Claims that the simplicity of google has
tagged google infomation_literacy searching by laallen ...on 27-MAR-06
We present a new method for content management and knowledge discovery using a topology-preserving neural network. The method, termed topological organization of content (TOC), can generate a taxonomy of topics from a set of unannotated, unstructured documents. The TOC consists of a hierarchy of self-organizing growing chains (GCs), each of which can develop independently in terms of size and topics. The dynamic development process is validated continuously using a proposed entropy-based Bayesian information criterion (BIC). Each chain meeting the criterion spans child chains, with reduced vocabularies and increased specializations. This results in a topological tree hierarchy, which can be browsed like a table of contents directory or web portal. A brief review is given on existing methods for document clustering and organization, and clustering validation measures. The proposed approach has been tested and compared with several existing methods on real world web page datasets. The results have clearly demonstrated the advantages and efficiency in content organization of the proposed method in terms of computational cost and representation. The TOC can be easily adapted for large-scale applications. The topology provides a unique, additional feature for retrieving related topics and confining the search space.