<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Monash, are there really only three kinds of data?</title>
	<atom:link href="http://facets.endeca.com/2010/01/monash-are-there-really-only-three-kinds-of-data/feed/" rel="self" type="application/rss+xml" />
	<link>http://facets.endeca.com/2010/01/monash-are-there-really-only-three-kinds-of-data/</link>
	<description>The many faces of discovery</description>
	<lastBuildDate>Mon, 23 Aug 2010 21:05:07 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: aferrari</title>
		<link>http://facets.endeca.com/2010/01/monash-are-there-really-only-three-kinds-of-data/comment-page-1/#comment-187</link>
		<dc:creator>aferrari</dc:creator>
		<pubDate>Fri, 05 Mar 2010 22:05:42 +0000</pubDate>
		<guid isPermaLink="false">http://facets.endeca.com/?p=75#comment-187</guid>
		<description>Dan, Thanks for the comment. I absolutely agree that there’s a ton of value in enhancing unstructured information by adding more structure! And this can be accomplished in many ways – for example, like text analytics like entity and sentiment extraction, or more basic approaches like joining information from structured and unstructured sources together. In fact many of these techniques combine quite nicely – for example, in an enterprise setting I might pull customer and employee names out of unstructured text fields, and then join on structured data from my CRM or HR databases. But these approaches really work best with a semi-structured data representation as the target as opposed to a relational store. For example, if I’m doing entity extraction it’s hard to tell which types of entities I will find, and in what quantities. If I can pull structure out of unstructured information, there are many incredibly valuable user interaction features that we can power, like faceted navigation, type-ahead search, structured range filters (e.g., geographic filters if I can pull out location data), data visualizations like tag clouds, and so on. But we need to be targeting the right kind of data model when we take these approaches; otherwise they become limited by schema complexity and commensurate query evaluation slowdowns. -Adam</description>
		<content:encoded><![CDATA[<p>Dan, Thanks for the comment. I absolutely agree that there’s a ton of value in enhancing unstructured information by adding more structure! And this can be accomplished in many ways – for example, like text analytics like entity and sentiment extraction, or more basic approaches like joining information from structured and unstructured sources together. In fact many of these techniques combine quite nicely – for example, in an enterprise setting I might pull customer and employee names out of unstructured text fields, and then join on structured data from my CRM or HR databases. But these approaches really work best with a semi-structured data representation as the target as opposed to a relational store. For example, if I’m doing entity extraction it’s hard to tell which types of entities I will find, and in what quantities. If I can pull structure out of unstructured information, there are many incredibly valuable user interaction features that we can power, like faceted navigation, type-ahead search, structured range filters (e.g., geographic filters if I can pull out location data), data visualizations like tag clouds, and so on. But we need to be targeting the right kind of data model when we take these approaches; otherwise they become limited by schema complexity and commensurate query evaluation slowdowns. -Adam</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dan Barbata</title>
		<link>http://facets.endeca.com/2010/01/monash-are-there-really-only-three-kinds-of-data/comment-page-1/#comment-185</link>
		<dc:creator>Dan Barbata</dc:creator>
		<pubDate>Fri, 05 Mar 2010 01:18:34 +0000</pubDate>
		<guid isPermaLink="false">http://facets.endeca.com/?p=75#comment-185</guid>
		<description>Is there any disadvantage you can see in trying to bring structure to unstructured information? By one way of thinking, the more structure you have the better. Wondering if you agree?</description>
		<content:encoded><![CDATA[<p>Is there any disadvantage you can see in trying to bring structure to unstructured information? By one way of thinking, the more structure you have the better. Wondering if you agree?</p>
]]></content:encoded>
	</item>
</channel>
</rss>
