Archive for the ‘databases’ Category
Hadoop + Hive + Endeca, Spotted in the Wild
by Pete Bell
In his post MapReduce just semi-good for semi-structured data, Adam Ferrari answered one of his FAQs about the relationship between Endeca and MapReduce, the popular big data cruncher. Now here’s one example of them complementing each other.
The question Adam answered was, if MapReduce is so powerful for processing big data, then what role does [...]
The hyping of the NoSQL foo
by Adam Ferrari
Adam Ferrari
Last week I discussed my impressions of the NoSQL movement, and because it’s such a fast-moving topic, I wanted to round up some interesting developments since then.
My main point in my last post was that NoSQL technology should not be defined diffusely as any of the new generation of databases. NoSQL is primarily motivated [...]
Let’s not let “NoSQL” go the way of “Web 2.0”
by Adam Ferrari
Adam Ferrari
I get asked all the time whether Endeca should be considered a “NoSQL” database. It’s a totally reasonable question. After all, our core engine shares some attributes of a NoSQL system – it’s a persistent data store, has a non-relational data model, and has convenient APIs for developing web applications. And it works at [...]
For Search and interactive BI, what’s the new disk?
by Adam Ferrari
Adam Ferrari
Since the recent New England Database Summit at MIT I’ve been thinking a lot about Storage Class Memory (SCM) – technologies like Flash and PCM that are vying to become the next persistent storage technology of choice. Buzz on this definitely picked up substantially last year with product announcements from many of the data [...]
UX matters and BI matters at new IEEE site
by Pete Bell
Pete Bell
The new IEEE Xplore Digital Library site just launched and it has an interesting type of numeric range filter, worth discussing from both a user experience and business intelligence angle. This filter is among several noteworthy faceted search designs at IEEE, including the choice to give each facet its own search box and breadcrumb.
IEEE [...]
How similar are faceted search and OLAP? See CIO Mag: 20 to Watch in 2010
by Adam Ferrari
Adam Ferrari
Endeca received a nice spot in CIO magazine’s new list, “Twenty companies to watch in 2010.” Compared with much of the coverage we get in the press, which tends to focus on one of our core market solutions such as Retail or Manufacturing, I was pleased with the breadth that CIO was able to [...]
Monash, are there really only three kinds of data?
by Adam Ferrari
Adam Ferrari
Is my data structured, unstructured, or semi-structured? Curt Monash provided yet another take on this never-ending data management question in a blog post earlier this week. This topic has generated tons of discussion over time, but despite this, common perceptions out there seem fairly straightforward. Basically, common wisdom holds that:
Structured data is the stuff [...]
MapReduce just semi-good for semi-structured data
by Adam Ferrari
Adam Ferrari
Sybase recently announced that it’s become the latest analytical database vendor to hop onto the MapReduce (MR) bandwagon. This trend has been ramping throughout the past year+, ignited by early innovators like Aster and Greenplum, followed fairly quickly by others such as Netezza and (somewhat surprisingly) Vertica.
One local outcome for yours truly has been [...]
Vertical stores for vertical web search?
by Adam Ferrari
Adam Ferrari
Before the holidays I made it over to MIT for a talk by Michael Stonebraker about his latest startup, Goby.com, a vertical search site focused on leisure and travel. Always up for a Stonebraker talk, I was particularly keen to see this one given the combination of elements – a true pioneer of database [...]