<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Google Search Appliance</title>
	<atom:link href="http://lib.byu.edu/sites/news/2007/01/11/google-search-appliance/feed/" rel="self" type="application/rss+xml" />
	<link>http://lib.byu.edu/sites/news/2007/01/11/google-search-appliance/</link>
	<description>Just another Lib.byu.edu weblog</description>
	<lastBuildDate>Sat, 06 Mar 2010 00:13:27 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>By: Malcolm Lambe</title>
		<link>http://lib.byu.edu/sites/news/2007/01/11/google-search-appliance/comment-page-1/#comment-91</link>
		<dc:creator>Malcolm Lambe</dc:creator>
		<pubDate>Sun, 30 Sep 2007 10:56:25 +0000</pubDate>
		<guid isPermaLink="false">https://blog.lib.byu.edu/wwg/?p=21#comment-91</guid>
		<description>Can you imagine a world without search engines? Can you imagine doing any thesis work without being able to search online? I can, because I&#039;m 56 but it was really difficult researching anything 20 or 30 years ago. And the time we used to waste going to the library and looking up the index! It blows my mind when I think of how it used to be. And now of course the Search Engines are even more powerful - maybe too powerful. It scares me the power that Google has. Off topic - but I&#039;m talking about the new 2D barcodes on my site - now they&#039;re interesting - whack up a tiny QR Code anywhere - scan it with an enabled cellphone and voila! - hyperlink to a site. Will this be the new graffiti? à bientôt, Lambe, Paris.</description>
		<content:encoded><![CDATA[<p>Can you imagine a world without search engines? Can you imagine doing any thesis work without being able to search online? I can, because I&#8217;m 56 but it was really difficult researching anything 20 or 30 years ago. And the time we used to waste going to the library and looking up the index! It blows my mind when I think of how it used to be. And now of course the Search Engines are even more powerful &#8211; maybe too powerful. It scares me the power that Google has. Off topic &#8211; but I&#8217;m talking about the new 2D barcodes on my site &#8211; now they&#8217;re interesting &#8211; whack up a tiny QR Code anywhere &#8211; scan it with an enabled cellphone and voila! &#8211; hyperlink to a site. Will this be the new graffiti? à bientôt, Lambe, Paris.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ryan Gardner</title>
		<link>http://lib.byu.edu/sites/news/2007/01/11/google-search-appliance/comment-page-1/#comment-90</link>
		<dc:creator>Ryan Gardner</dc:creator>
		<pubDate>Fri, 02 Feb 2007 01:53:33 +0000</pubDate>
		<guid isPermaLink="false">https://blog.lib.byu.edu/wwg/?p=21#comment-90</guid>
		<description>The google search appliance is only as good as the data you feed it.  I set up a Google mini - which is a stripped-down version of a google search appliance, and it does have some nice tools.

Good luck getting a Google Search Appliance to parse MARC records and extract meaningful data.

What kind of searching are you trying to get it to consolidate? How do you expect it to score relevancy? Do you expect it to put item records from your library right up alongside web resources? How do you determine which book is more relevant?

The Google appliance does a good job at providing an easy-to-customize (via XSLT tinkering) search box that will catalog all your stuff. I know on the mini it wasn&#039;t even possible to have it search anything that was protected with anything more than HTTP auth in the header - but I know the search appliance has more features for that kind of stuff. Don&#039;t expect that part to be easy though.

The biggest downfall I see in the Search appliane is that the relevancy score is weighted so heavily on incoming links. When most of your pages at the bottom level have the same number of incoming links - it will fall back on metadata and page analysis.

It doesn&#039;t have a magic wand to search your content for you. There is no such thing as a &quot;magic relevancy&quot; score that can compare results from a wide variety of sources. Somehow, one of those sources is likely going to be looked at as &quot;more relevant&quot; by the search engine just based on how the content is presented to the bot than another source, and that source&#039;s information will come up much higher.

I finished up at BYU last year - but unless things have changed you still have a large and powerful Computer Science department that is full of talented students. Most of them are probably willing to work for pennies. Have them build you a Lucene-based search box that will do your dirty work for you. Heck - you could probably even get a professor to get kids to work on it for a class or something.

Buying a Google appliance is a good decision for someone for whom it is cheaper than developing one internally - or if you really want a cool-looking server to put in your datacenter. I don&#039;t know if the Google appliances come wiht T-shirts, but the mini sure does :)

I guess the other question I have is - who provides your library software? If they can&#039;t deliver a search interface that handles all your assets, why are you using them?</description>
		<content:encoded><![CDATA[<p>The google search appliance is only as good as the data you feed it.  I set up a Google mini &#8211; which is a stripped-down version of a google search appliance, and it does have some nice tools.</p>
<p>Good luck getting a Google Search Appliance to parse MARC records and extract meaningful data.</p>
<p>What kind of searching are you trying to get it to consolidate? How do you expect it to score relevancy? Do you expect it to put item records from your library right up alongside web resources? How do you determine which book is more relevant?</p>
<p>The Google appliance does a good job at providing an easy-to-customize (via XSLT tinkering) search box that will catalog all your stuff. I know on the mini it wasn&#8217;t even possible to have it search anything that was protected with anything more than HTTP auth in the header &#8211; but I know the search appliance has more features for that kind of stuff. Don&#8217;t expect that part to be easy though.</p>
<p>The biggest downfall I see in the Search appliane is that the relevancy score is weighted so heavily on incoming links. When most of your pages at the bottom level have the same number of incoming links &#8211; it will fall back on metadata and page analysis.</p>
<p>It doesn&#8217;t have a magic wand to search your content for you. There is no such thing as a &#8220;magic relevancy&#8221; score that can compare results from a wide variety of sources. Somehow, one of those sources is likely going to be looked at as &#8220;more relevant&#8221; by the search engine just based on how the content is presented to the bot than another source, and that source&#8217;s information will come up much higher.</p>
<p>I finished up at BYU last year &#8211; but unless things have changed you still have a large and powerful Computer Science department that is full of talented students. Most of them are probably willing to work for pennies. Have them build you a Lucene-based search box that will do your dirty work for you. Heck &#8211; you could probably even get a professor to get kids to work on it for a class or something.</p>
<p>Buying a Google appliance is a good decision for someone for whom it is cheaper than developing one internally &#8211; or if you really want a cool-looking server to put in your datacenter. I don&#8217;t know if the Google appliances come wiht T-shirts, but the mini sure does <img src='http://lib.byu.edu/sites/news/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>I guess the other question I have is &#8211; who provides your library software? If they can&#8217;t deliver a search interface that handles all your assets, why are you using them?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tom DeForest</title>
		<link>http://lib.byu.edu/sites/news/2007/01/11/google-search-appliance/comment-page-1/#comment-89</link>
		<dc:creator>Tom DeForest</dc:creator>
		<pubDate>Wed, 17 Jan 2007 18:07:56 +0000</pubDate>
		<guid isPermaLink="false">https://blog.lib.byu.edu/wwg/?p=21#comment-89</guid>
		<description>Thanks. We&#039;ll look into these. Have you used either of them? Any advice?</description>
		<content:encoded><![CDATA[<p>Thanks. We&#8217;ll look into these. Have you used either of them? Any advice?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rico</title>
		<link>http://lib.byu.edu/sites/news/2007/01/11/google-search-appliance/comment-page-1/#comment-88</link>
		<dc:creator>Rico</dc:creator>
		<pubDate>Wed, 17 Jan 2007 04:29:33 +0000</pubDate>
		<guid isPermaLink="false">https://blog.lib.byu.edu/wwg/?p=21#comment-88</guid>
		<description>Save your pennies for another day...
Try these on for size instead of the Google Search Appliance:

http://swish-e.org/
http://www.htdig.org/</description>
		<content:encoded><![CDATA[<p>Save your pennies for another day&#8230;<br />
Try these on for size instead of the Google Search Appliance:</p>
<p><a href="http://swish-e.org/" rel="nofollow">http://swish-e.org/</a><br />
<a href="http://www.htdig.org/" rel="nofollow">http://www.htdig.org/</a></p>
]]></content:encoded>
	</item>
</channel>
</rss>

