<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>PlanetMysql.ru - информация о СУБД MySQL &#187; calpont</title>
	<atom:link href="http://planetmysql.ru/category/calpont/feed/" rel="self" type="application/rss+xml" />
	<link>http://planetmysql.ru</link>
	<description>Блог о самой популярной СУБД MySQL</description>
	<lastBuildDate>Fri, 10 Feb 2012 22:53:14 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3</generator>
		<item>
		<title>No North Texas Users Group Meeting in May, new job</title>
		<link>http://dave-stokes.blogspot.com/2010/04/no-north-texas-users-group-meeting-in.html?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=no-north-texas-users-group-meeting-in-may-new-job</link>
		<comments>http://dave-stokes.blogspot.com/2010/04/no-north-texas-users-group-meeting-in.html#comments</comments>
		<pubDate>Fri, 30 Apr 2010 18:28:00 +0000</pubDate>
		<dc:creator>Dave Stokes</dc:creator>
				<category><![CDATA[calpont]]></category>
		<category><![CDATA[certification]]></category>
		<category><![CDATA[infinidb]]></category>

		<guid isPermaLink="false"></guid>
		<description><![CDATA[There will not be a May meeting of the North Texas MySQL Users Group but I am scrambling to get us a great location for the June meeting.  I have left my Job with MySQL/Sun/Oracle and am no longer responsible for MySQL Certifications.  That means my access to the Sun office were we have been meeting is ended also. I am now the Customer Service Manager for Calpont and the InfiniDB storage engine.  InfiniDB is column based, multi-threaded, and the community edition is open source.  If you run intensive reports against InnoDB or MyIsam databases, you need to investigate this product. And if you regularly run massive queries in data warehousing or business analytic operation that you need to see how the massively parallel process architecture of the enterprise product can make life easier.]]></description>
			<content:encoded><![CDATA[There will not be a May meeting of the <i>North Texas MySQL Users Group</i> but I am scrambling to get us a great location for the June meeting.  I have left my Job with MySQL/Sun/Oracle and am no longer responsible for MySQL Certifications.  That means my access to the Sun office were we have been meeting is ended also. <br /><br />I am now the Customer Service Manager for <a href="http://www.calpont.com">Calpont</a> and the InfiniDB storage engine.  <a href="http://infinidb.org">InfiniDB</a> is column based, multi-threaded, and the community edition is open source.  If you run intensive reports against InnoDB or MyIsam databases, you need to investigate this product. And if you regularly run massive queries in data warehousing or business analytic operation that you need to see how the massively parallel process architecture of the enterprise product can make life easier.<div><img width="1" height="1" src="https://blogger.googleusercontent.com/tracker/8575059197193667898-2643426545508798100?l=dave-stokes.blogspot.com" alt="" /></div><br/>PlanetMySQL Voting:
	 <a href="http://planet.mysql.com/entry/vote/?entry_id=24588&vote=1&apivote=1">Vote UP</a> /
	 <a href="http://planet.mysql.com/entry/vote/?entry_id=24588&vote=-1&apivote=1">Vote DOWN</a>]]></content:encoded>
			<wfw:commentRss>http://planetmysql.ru/2010/04/30/no-north-texas-users-group-meeting-in-may-new-job/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CAOS Theory Podcast 2010.02.05</title>
		<link>http://feedproxy.google.com/~r/451opensource/~3/AQEBenJwpxE/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=caos-theory-podcast-2010-02-05</link>
		<comments>http://feedproxy.google.com/~r/451opensource/~3/AQEBenJwpxE/#comments</comments>
		<pubDate>Fri, 05 Feb 2010 20:23:17 +0000</pubDate>
		<dc:creator>The 451 Group</dc:creator>
				<category><![CDATA[alfresco]]></category>
		<category><![CDATA[android]]></category>
		<category><![CDATA[Apache]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[calpont]]></category>
		<category><![CDATA[canonical]]></category>
		<category><![CDATA[caos theory]]></category>
		<category><![CDATA[caostheory]]></category>
		<category><![CDATA[Chris Hazelton]]></category>
		<category><![CDATA[Chrome]]></category>
		<category><![CDATA[Coraid]]></category>
		<category><![CDATA[data warehousing]]></category>
		<category><![CDATA[developers]]></category>
		<category><![CDATA[eclipse]]></category>
		<category><![CDATA[glassfish]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[gpl]]></category>
		<category><![CDATA[ipad]]></category>
		<category><![CDATA[iphone]]></category>
		<category><![CDATA[jay lyman]]></category>
		<category><![CDATA[LGPL]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[matt asay]]></category>
		<category><![CDATA[matt aslett]]></category>
		<category><![CDATA[mysql]]></category>
		<category><![CDATA[netbeans]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[OpenOffi]]></category>
		<category><![CDATA[Podcast]]></category>

		<guid isPermaLink="false">http://blogs.the451group.com/opensource/?p=1343</guid>
		<description><![CDATA[Topics for this podcast:
*Matt Asay moves from Alfresco to Canonical
*GPL fade fuels heated discussion
*Apple&#8217;s iPad and its enterprise and open source impact
*Open source in data warehousing and storage
*Our perspective on Oracle&#8217;s plans for Sun open source
iTunes or direct download (32:50, 9.2 MB)
]]></description>
			<content:encoded><![CDATA[<p>Topics for this podcast:</p>
<p>*Matt Asay moves from Alfresco to Canonical<br />
*GPL fade fuels heated discussion<br />
*Apple&#8217;s iPad and its enterprise and open source impact<br />
*Open source in data warehousing and storage<br />
*Our perspective on Oracle&#8217;s plans for Sun open source</p>
<p><a href="http://phobos.apple.com/WebObjects/MZStore.woa/wa/viewPodcast?id=280595473">iTunes</a> or <a href="http://media.libsyn.com/media/caostheory/CAOSTheory20100205.mp3">direct download</a> (32:50, 9.2 MB)</p>
<img src="http://feeds.feedburner.com/~r/451opensource/~4/AQEBenJwpxE" height="1" width="1" /><br/>PlanetMySQL Voting:
	 <a href="http://planet.mysql.com/entry/vote/?entry_id=23354&vote=1&apivote=1">Vote UP</a> /
	 <a href="http://planet.mysql.com/entry/vote/?entry_id=23354&vote=-1&apivote=1">Vote DOWN</a>]]></content:encoded>
			<wfw:commentRss>http://planetmysql.ru/2010/02/05/caos-theory-podcast-2010-02-05/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Two new open source data warehousing launches</title>
		<link>http://feedproxy.google.com/~r/451opensource/~3/CLWu6NwVYWU/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=two-new-open-source-data-warehousing-launches</link>
		<comments>http://feedproxy.google.com/~r/451opensource/~3/CLWu6NwVYWU/#comments</comments>
		<pubDate>Wed, 28 Oct 2009 11:33:53 +0000</pubDate>
		<dc:creator>The 451 Group</dc:creator>
				<category><![CDATA[451 group]]></category>
		<category><![CDATA[451caostheory]]></category>
		<category><![CDATA[451group]]></category>
		<category><![CDATA[calpont]]></category>
		<category><![CDATA[caostheory]]></category>
		<category><![CDATA[Dynamo BI]]></category>
		<category><![CDATA[dynamodb]]></category>
		<category><![CDATA[infobright]]></category>
		<category><![CDATA[John Sichi]]></category>
		<category><![CDATA[Kickfire]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[luciddb]]></category>
		<category><![CDATA[lucidera]]></category>
		<category><![CDATA[matt aslett]]></category>
		<category><![CDATA[mattaslett]]></category>
		<category><![CDATA[matthew aslett]]></category>
		<category><![CDATA[matthewaslett]]></category>
		<category><![CDATA[mysql]]></category>
		<category><![CDATA[Nicholas Goodman]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[opensource]]></category>
		<category><![CDATA[oracle]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[sun]]></category>
		<category><![CDATA[The 451 Grou]]></category>

		<guid isPermaLink="false">http://blogs.the451group.com/opensource/?p=1266</guid>
		<description><![CDATA[In our recent report on the data warehousing market we speculated that there would soon be a change in the number of vendors operating in what is a crowded market. We were anticipating that the number of vendors would go down, rather than up, but - in the short term at least - we have been proved wrong, as two new open source analytical databases emerged this week.
First came the formation of Dynamo Business Intelligence Corp, (aka Dynamo BI), a new commercially supported distribution, and sponsor, of LucidDB. Then came the launch of InfiniDB Community Edition, a new open source analytic database based on MySQL from Calpont.
Read the rest of this post on our Too Much Information blog.
]]></description>
			<content:encoded><![CDATA[<p>In our recent <a href="http://www.the451group.com/special_reports/special_report_detail.php?icid=914">report on the data warehousing market</a> we speculated that there would soon be a change in the number of vendors operating in what is a crowded market. We were anticipating that the number of vendors would go down, rather than up, but - in the short term at least - we have been proved wrong, as two new open source analytical databases emerged this week.</p>
<p>First came the <a href="http://n2.nabble.com/Introducing-Dynamo-BI-tt3883211.html">formation</a> of Dynamo Business Intelligence Corp, (aka Dynamo BI), a new commercially supported distribution, and sponsor, of LucidDB. Then came the <a href="http://www.calpont.com/press/October-26-2009.html">launch</a> of InfiniDB Community Edition, a new open source analytic database based on MySQL from Calpont.</p>
<p>Read the rest of <a href="http://blogs.the451group.com/information_management/2009/10/28/because-20-data-warehousing-vendors-is-never-enough/">this post</a> on our Too Much Information blog.</p>
<img src="http://feeds.feedburner.com/~r/451opensource/~4/CLWu6NwVYWU" height="1" width="1" /><br/>PlanetMySQL Voting:
	 <a href="http://planet.mysql.com/entry/vote/?entry_id=21952&vote=1&apivote=1">Vote UP</a> /
	 <a href="http://planet.mysql.com/entry/vote/?entry_id=21952&vote=-1&apivote=1">Vote DOWN</a>]]></content:encoded>
			<wfw:commentRss>http://planetmysql.ru/2009/10/28/two-new-open-source-data-warehousing-launches/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Interview with Robin Schumacher Discussing Calpont&#8217;s InfiniDB</title>
		<link>http://sqlnow.blogspot.com/2009/10/interview-with-robin-schumacher.html?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=interview-with-robin-schumacher-discussing-calponts-infinidb</link>
		<comments>http://sqlnow.blogspot.com/2009/10/interview-with-robin-schumacher.html#comments</comments>
		<pubDate>Tue, 27 Oct 2009 14:54:00 +0000</pubDate>
		<dc:creator>Jimmy Guerrero</dc:creator>
				<category><![CDATA[calpont]]></category>
		<category><![CDATA[infinidb]]></category>
		<category><![CDATA[mysql]]></category>

		<guid isPermaLink="false"></guid>
		<description><![CDATA[I got a chance to sit down with Calpont's new VP of Products, Robin Schumacher last week to discuss the announcement of their new open source column-oriented database, InfiniDB. To listen to the interview, click on the embedded link below.Robin has also posted part one of a two part technical article on InfiniDB in the DevZone. You can check out the article here.You can also get "up close and personal" with InfiniDB when Robin delivers a free webinar on the advantages of column oriented databases for analytics, high performance query systems and data warehousing next month. Register here.For more info on what Calpont is up to with InfiniDB, check out their community .org site here.]]></description>
			<content:encoded><![CDATA[I got a chance to sit down with Calpont's new VP of Products, Robin Schumacher last week to discuss the announcement of their new open source column-oriented database, InfiniDB. To listen to the interview, click on the embedded link below.<br /><br /><br /><br />Robin has also posted part one of a two part technical article on InfiniDB in the DevZone. You can check out the article <a href="http://dev.mysql.com/tech-resources/articles/mysql_datawarehouse_calpont1.html">here</a>.<br /><br />You can also get "up close and personal" with InfiniDB when Robin delivers a free webinar on the advantages of column oriented databases for analytics, high performance query systems and data warehousing next month. Register <a href="http://www.mysql.com/news-and-events/web-seminars/display-459.html">here</a>.<br /><br />For more info on what Calpont is up to with InfiniDB, check out their community .org site <a href="http://www.infinidb.org/index.php">here</a>.<div><img width="1" height="1" src="https://blogger.googleusercontent.com/tracker/3684039529233307799-7135187431527750869?l=sqlnow.blogspot.com" /></div><br/>PlanetMySQL Voting:
	 <a href="http://planet.mysql.com/entry/vote/?entry_id=21939&vote=1&apivote=1">Vote UP</a> /
	 <a href="http://planet.mysql.com/entry/vote/?entry_id=21939&vote=-1&apivote=1">Vote DOWN</a>]]></content:encoded>
			<wfw:commentRss>http://planetmysql.ru/2009/10/27/interview-with-robin-schumacher-discussing-calponts-infinidb/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Calpont opens up: InfiniDB Open Source Analytical Database (based on MySQL)</title>
		<link>http://rpbouman.blogspot.com/2009/10/calpont-opens-up-infinidb-open-source.html?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=calpont-opens-up-infinidb-open-source-analytical-database-based-on-mysql</link>
		<comments>http://rpbouman.blogspot.com/2009/10/calpont-opens-up-infinidb-open-source.html#comments</comments>
		<pubDate>Tue, 27 Oct 2009 13:54:00 +0000</pubDate>
		<dc:creator>Roland Bouman</dc:creator>
				<category><![CDATA[analytic databases]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[calpont]]></category>
		<category><![CDATA[column oriented databases]]></category>
		<category><![CDATA[data warehousing]]></category>
		<category><![CDATA[infobright]]></category>
		<category><![CDATA[luciddb]]></category>
		<category><![CDATA[monetdb]]></category>
		<category><![CDATA[mysql]]></category>
		<category><![CDATA[partitioning]]></category>

		<guid isPermaLink="false"></guid>
		<description><![CDATA[Open source business intelligence and data warehousing are on the rise!If you kept up with the MySQL Performance Blog, you might have noticed a number of posts comparing the open source analytical databases Infobright, LucidDB, and MonetDB. LucidDB got some more news last week when Nick Goodman announced that the Dynamo Business Intelligence Corporation will be offering services around LucidDB, branding it as DynamoDB.Now, to top if off, Calpont has just released InfiniDB, a GPLv2 open source version of its analytical database offering, which is based on the MySQL server.So, let's take a quick look at InfiniDB. I haven't yet played around with it, but the features sure look interesting:Column-oriented architecture (like all other analytical database products mentioned)Transparent compressionVertical and horizontal partitioning: on top of being column-oriented, data is also partitioned, potentially allowing for less IO to access data.MVCC and support for high concurrency. It would be interesting to see how much benefit this gives when loading data, because this is usually one of the bottle necks for column-oriented databasesSupport for ACID/TransactionsHigh performance bulkloaderNo specialized hardware - InfiniDB is a pure software solution that can run on commidity hardwareMySQL compatibleThe website sums up a few more features and benefits, but I think this covers the most important ones. Calpont also offers a closed source enterprise edition, which differs from the open source by offering support for multi-node scale-out support. By that, they do not mean regular MySQL replication scale-out. Instead, the enterprise edition features a true distributed database architecture which allows you to divide incoming requests across a layer of so-called "user modules" (MySQL front ends) and "performance modules" (the actual workhorses that partition, retrieve and cache data). In this scenario, the user modules break the queries they recieve from client applications into pieces, and send them to one or more performance modules in a parallel fashion. The performance modules then retrieve the actual data from either their cache, or from the disk, and sends those back to the user modules which re-assemble the partial and intermediate results to the final resultset which is sent back to the client. (see picture)Given the MySQL compatibility and otherwise similar features, I think it is fair to compare the open source InfiniDB offering to the Infobright community edition. Interesting differences are that InfiniDB supports all usual DML statements (INSERT, DELETE, UPDATE), and that InfiniDB offers the same bulkloader in both the community edition as well as the enterprise edition: Infobright community edition does not support DML, and offers a bulk loader that is less performant than the one included in its enterprise edition. I have not heard of an InfoBright multi-node option, so when comparing the enterprise edition featuresets, that seems like an advantage too in Calpont's offering.Please understand that I am not endorsing one of these products over the other: I'm just doing a checkbox feature list comparison here. What it mostly boils down to, is that users that need an affordable analytical database now have even more choice  than before. In addition, it adds a bit more competition for the vendors, and I expect them all to improve as a result of that. These are interesting times for the BI and data warehousing market :)]]></description>
			<content:encoded><![CDATA[Open source business intelligence and data warehousing are on the rise!<br /><br />If you kept up with the <a href="http://www.mysqlperformanceblog.com/" >MySQL Performance Blog</a>, you might have <a href="http://www.mysqlperformanceblog.com/2009/10/26/air-traffic-queries-in-luciddb/" >noticed</a> a <a href="http://www.mysqlperformanceblog.com/2009/10/02/analyzing-air-traffic-performance-with-infobright-and-monetdb/" >number</a> of <a href="http://www.mysqlperformanceblog.com/2009/09/29/quick-comparison-of-myisam-infobright-and-monetdb/" >posts</a> comparing the open source analytical databases <a href="http://www.infobright.org/">Infobright</a>, <a href="http://www.luciddb.org/" >LucidDB</a>, and <a href="http://monetdb.cwi.nl/">MonetDB</a>. LucidDB <a href="http://www.nicholasgoodman.com/bt/blog/2009/10/24/luciddb-dynamobi-is-running-with-it/" >got</a> some <a href="http://n2.nabble.com/Introducing-Dynamo-BI-td3883211.html" >more</a> news <a href="http://thinkwaitfast.blogspot.com/2009/10/introducing-dynamo-bi.html">last</a> week when <a href="http://www.nicholasgoodman.com/bt/blog/" >Nick Goodman</a> announced that the Dynamo Business Intelligence Corporation will be offering services around LucidDB, branding it as DynamoDB.<br /><br />Now, to top if off, <a href="http://www.calpont.com/" >Calpont</a> has just released <a href="http://infinidb.org/resources/what-is-infinidb" >InfiniDB</a>, a GPLv2 open source version of its analytical database offering, which is based on the MySQL server.<br /><br />So, let's take a quick look at InfiniDB. I haven't yet played around with it, but the features sure look interesting:<ul><br /><li>Column-oriented architecture (like all other analytical database products mentioned)</li><br /><li>Transparent compression</li><br /><li>Vertical and horizontal partitioning: on top of being column-oriented, data is also partitioned, potentially allowing for less IO to access data.</li><br /><li>MVCC and support for high concurrency. It would be interesting to see how much benefit this gives when loading data, because this is usually one of the bottle necks for column-oriented databases</li><br /><li>Support for ACID/Transactions</li><br /><li>High performance bulkloader</li><br /><li>No specialized hardware - InfiniDB is a pure software solution that can run on commidity hardware</li><br /><li>MySQL compatible</li><br /></ul><br />The website sums up a few more features and benefits, but I think this covers the most important ones. <br /><br />Calpont also offers a closed source enterprise edition, which differs from the open source by offering support for multi-node scale-out support. By that, they do not mean regular MySQL replication scale-out. Instead, the enterprise edition features a true distributed database architecture which allows you to divide incoming requests across a layer of so-called "user modules" (MySQL front ends) and "performance modules" (the actual workhorses that partition, retrieve and cache data). In this scenario, the user modules break the queries they recieve from client applications into pieces, and send them to one or more performance modules in a parallel fashion. The performance modules then retrieve the actual data from either their cache, or from the disk, and sends those back to the user modules which re-assemble the partial and intermediate results to the final resultset which is sent back to the client. (see picture)<br /><a href="http://www.flickr.com/photos/21931585@N07/4049476409/" title="shared-disk-arch-simple by roland.bouman, on Flickr"><img src="http://farm3.static.flickr.com/2563/4049476409_a124c2b147_o.jpg" width="821" height="465" alt="shared-disk-arch-simple" /></a><br />Given the MySQL compatibility and otherwise similar features, I think it is fair to compare the open source InfiniDB offering to the Infobright community edition. Interesting differences are that InfiniDB supports all usual DML statements (<code>INSERT</code>, <code>DELETE</code>, <code>UPDATE</code>), and that InfiniDB offers the same bulkloader in both the community edition as well as the enterprise edition: Infobright community edition does not support DML, and offers a bulk loader that is less performant than the one included in its enterprise edition. I have not heard of an InfoBright multi-node option, so when comparing the enterprise edition featuresets, that seems like an advantage too in Calpont's offering.<br /><br />Please understand that I am not endorsing one of these products over the other: I'm just doing a checkbox feature list comparison here. What it mostly boils down to, is that users that need an affordable analytical database now have even more choice  than before. In addition, it adds a bit more competition for the vendors, and I expect them all to improve as a result of that. These are interesting times for the BI and data warehousing market :)<div><img width="1" height="1" src="https://blogger.googleusercontent.com/tracker/15319370-3110806468379653967?l=rpbouman.blogspot.com" /></div><br/>PlanetMySQL Voting:
	 <a href="http://planet.mysql.com/entry/vote/?entry_id=21933&vote=1&apivote=1">Vote UP</a> /
	 <a href="http://planet.mysql.com/entry/vote/?entry_id=21933&vote=-1&apivote=1">Vote DOWN</a>]]></content:encoded>
			<wfw:commentRss>http://planetmysql.ru/2009/10/27/calpont-opens-up-infinidb-open-source-analytical-database-based-on-mysql/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>EU Should Protect MySQL-based Special Purpose Database Vendors</title>
		<link>http://rpbouman.blogspot.com/2009/09/eu-should-protect-mysql-based-special.html?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=eu-should-protect-mysql-based-special-purpose-database-vendors</link>
		<comments>http://rpbouman.blogspot.com/2009/09/eu-should-protect-mysql-based-special.html#comments</comments>
		<pubDate>Sat, 12 Sep 2009 00:35:00 +0000</pubDate>
		<dc:creator>Roland Bouman</dc:creator>
				<category><![CDATA[antitrust]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[calpont]]></category>
		<category><![CDATA[data warehousing]]></category>
		<category><![CDATA[eu]]></category>
		<category><![CDATA[exadata]]></category>
		<category><![CDATA[infobright]]></category>
		<category><![CDATA[Kickfire]]></category>
		<category><![CDATA[mysql]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Oracle / Sun deal]]></category>
		<category><![CDATA[scaledb]]></category>

		<guid isPermaLink="false"></guid>
		<description><![CDATA[In my recent post on the EU antitrust regulators' probe into the Oracle Sun merger I did not mention an important class of stakeholders: the MySQL-based special purpose database startups. By these I mean:KickfireInfobrightCalpontScaleDBI think it's safe to say the first three are comparable in the sense that they are all analytical databases: they are designed for data warehousing and business intelligence applications. ScaleDB might be a good fit for those applications, but I think it's architecture is sufficiently different from the first three to not call it an analytical database.For Kickfire and Infobright, the selling point is that they are offering a relatively cheap solution to build large data warehouses and responsive business intelligence applications. (I can't really find enough information on Calpoint pricing, although they do mention low total cost of ownership.) An extra selling point is that they are MySQL compatible, which may make some difference for some customers. But that compatibility is in my opinion not as important as the availability of a serious data warehousing solution at a really sharp price.Now, in my previous post, I mentioned that the MySQL and Oracle RDBMS products are very different, and I do not perceive them as competing. Instead of trying to kill the plain MySQL database server product, Oracle should take advantage of a huge opportunity to help shape the web by being a good steward, leading ongoing MySQL development, and in addition, enable their current Oracle Enterprise customers to build cheap LAMP-based websites (with the possibility of adding value by offering Oracle to MySQL data integration).For these analytical database solutions, things may be different though. I think these MySQL based analytical databases really are competitive to Oracle's Exadata analytical appliance. Oracle could form a serious threat to these MySQL-based analytical database vendors. After the merger, Oracle would certainly be in a position to hamper these vendors by resticting the non-GPL licensed usage of MySQL.In a recent ad, Oracle vouched to increase investments in developing Sun's hardware and operating system technology. And this would eventually put them in an even better position to create appliances like Exadata, allowing them to ditch an external hardware partner like HP (which is their Exadata hardware partner).So, all in all, in my opinion the EU should definitely take a serious look at the dynamics of the analytical database market and decide how much impact the Oracle / Sun merger could have on this particular class of MySQL OEM customers. The rise of these relatvely cheap MySQL-based analytical databases is a very interesting development for the business intelligence and data warehousing space in general, and means a big win for customers that need affordable datawarhousing / business intelligence. It would be a shame if it would be curtailed by Oracle. After the merger, Oracle sure would have the means and the motive, so if someone needs protection, I think it would be these MySQL-based vendors of analytical databases.As always, these are just my musing and opinions - speculation is free. Feel free to correct me, add applause or point out my ignorance :)]]></description>
			<content:encoded><![CDATA[In <a href="http://rpbouman.blogspot.com/2009/09/mysql-factor-in-eus-decision.html" >my recent post</a> on the EU antitrust regulators' probe into the Oracle Sun merger I did not mention an important class of stakeholders: the MySQL-based special purpose database startups. By these I mean:<br /><ul><br /><li><a href="http://www.kickfire.com/" >Kickfire</a></li><br /><li><a href="http://www.infobright.org/" >Infobright</a></li><br /><li><a href="http://www.calpont.com/" >Calpont</a></li><br /><li><a href="http://www.scaledb.com/" >ScaleDB</a></li><br /></ul><br />I think it's safe to say the first three are comparable in the sense that they are all analytical databases: they are designed for data warehousing and business intelligence applications. ScaleDB might be a good fit for those applications, but I think it's architecture is sufficiently different from the first three to not call it an analytical database.<br /><br />For Kickfire and Infobright, the selling point is that they are offering a relatively cheap solution to build large data warehouses and responsive business intelligence applications. (I can't really find enough information on Calpoint pricing, although they do mention low total cost of ownership.) An extra selling point is that they are MySQL compatible, which may make some difference for some customers. But that compatibility is in my opinion not as important as the availability of a serious data warehousing solution at a really sharp price.<br /><br />Now, in my previous post, I mentioned that the MySQL and Oracle RDBMS products are very different, and I do not perceive them as competing. Instead of trying to kill the plain MySQL database server product, Oracle should take advantage of a huge opportunity to help shape the web by being a good steward, leading ongoing MySQL development, and in addition, enable their current Oracle Enterprise customers to build cheap LAMP-based websites (with the possibility of adding value by offering Oracle to MySQL data integration).<br /><br />For these analytical database solutions, things may be different though. <br /><br />I think these MySQL based analytical databases really are competitive to Oracle's <a href="http://www.oracle.com/database/exadata.html" >Exadata</a> analytical appliance. Oracle could form a serious threat to these MySQL-based analytical database vendors. After the merger, Oracle would certainly be in a position to hamper these vendors by resticting the non-GPL licensed usage of MySQL.<br /><a href="http://www.oracle.com/features/suncustomers.html" >In a recent ad</a>, Oracle vouched to increase investments in developing Sun's hardware and operating system technology. And this would eventually put them in an even better position to create appliances like Exadata, allowing them to ditch an external hardware partner like HP (which is their Exadata hardware partner).<br /><br />So, all in all, in my opinion the EU should definitely take a serious look at the dynamics of the analytical database market and decide how much impact the Oracle / Sun merger could have on this particular class of MySQL OEM customers. The rise of these relatvely cheap MySQL-based analytical databases is a very interesting development for the business intelligence and data warehousing space in general, and means a big win for customers that need affordable datawarhousing / business intelligence. It would be a shame if it would be curtailed by Oracle. After the merger, Oracle sure would have the means and the motive, so if someone needs protection, I think it would be these MySQL-based vendors of analytical databases.<br /><br />As always, these are just my musing and opinions - speculation is free. Feel free to correct me, add applause or point out my ignorance :)<div><img width="1" height="1" src="https://blogger.googleusercontent.com/tracker/15319370-8581340365503538920?l=rpbouman.blogspot.com" /></div><br/>PlanetMySQL Voting:
	 <a href="http://planet.mysql.com/entry/vote/?entry_id=21085&vote=1&apivote=1">Vote UP</a> /
	 <a href="http://planet.mysql.com/entry/vote/?entry_id=21085&vote=-1&apivote=1">Vote DOWN</a>]]></content:encoded>
			<wfw:commentRss>http://planetmysql.ru/2009/09/12/eu-should-protect-mysql-based-special-purpose-database-vendors/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The EC is mostly, but not entirely, wrong about Oracle/MySQL</title>
		<link>http://feedproxy.google.com/~r/451opensource/~3/1XCCTPZWvmg/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=the-ec-is-mostly-but-not-entirely-wrong-about-oraclemysql</link>
		<comments>http://feedproxy.google.com/~r/451opensource/~3/1XCCTPZWvmg/#comments</comments>
		<pubDate>Fri, 04 Sep 2009 11:34:16 +0000</pubDate>
		<dc:creator>The 451 Group</dc:creator>
				<category><![CDATA[451 group]]></category>
		<category><![CDATA[451caostheory]]></category>
		<category><![CDATA[451group]]></category>
		<category><![CDATA[calpont]]></category>
		<category><![CDATA[caostheory]]></category>
		<category><![CDATA[copyright]]></category>
		<category><![CDATA[european commission]]></category>
		<category><![CDATA[gpl]]></category>
		<category><![CDATA[infobright]]></category>
		<category><![CDATA[Kickfire]]></category>
		<category><![CDATA[Licensing]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[M&A]]></category>
		<category><![CDATA[MariaDB]]></category>
		<category><![CDATA[matt aslett]]></category>
		<category><![CDATA[mattaslett]]></category>
		<category><![CDATA[matthew aslett]]></category>
		<category><![CDATA[matthewaslett]]></category>
		<category><![CDATA[monty program]]></category>
		<category><![CDATA[mysql]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[opensource]]></category>
		<category><![CDATA[oracle]]></category>
		<category><![CDATA[s]]></category>
		<category><![CDATA[Software]]></category>

		<guid isPermaLink="false">http://blogs.the451group.com/opensource/?p=1220</guid>
		<description><![CDATA[By now you are probably aware that the European Commission has decided to launch an extended investigation into Oracle&#8217;s acquisition of Sun based on concerns over MySQL.
The new has prompted a lot of criticism of the EC, much of it suggesting that the delay will do considerable harm to Sun (and therefore Oracle). This argument is valid - Sun&#8217;s already declining revenue has been in freefall since the deal was announced and one wonders how far it will fall in another 90 days of stasis.
Other criticism, (such as this from Matt Asay) focuses on the suggestion that the delay will do little to help MySQL or its users, and that the EC fails to understand open source.
This also has some validity. The EC talks about &#8220;Oracle&#8217;s incentive to further develop MySQL as an open source database&#8221; but as Matt points out &#8220;even Oracle can&#8217;t put the open-source genie back in the bottle once it has been released, as MySQL has, under the GNU General Public License.&#8221;
This is true. although I would argue, that Oracle&#8217;s potential control over MySQL is not about licensing, but copyright. The FT states that Oracle &#8220;doesn’t control the IP, since the software is available under the GPL&#8221;. That is not entirely true. The existing code will always be under the GPL but as the copyright for that code would be fully-owned by Oracle it is under no obligation to release future developments under the GPL.
I do not expect that to happen, but copyright ownership does not just impact the ability to license code, it also provides control over potential commercial uses of that code. This is where it could be argued that the EC could be right to have anti-competitive concerns over Oracle&#8217;s future ownership of MySQL (even if it doesn&#8217;t understand why, or hasn&#8217;t articulated that it does).
Criticism of the EC has also suggested that it is disproportionately focusing on a products with a tiny market share. There are various suggestions as to quite how small MySQL&#8217;s market share is, with the WSJ citing 0.2%, but also 1.5%, AHN 0.04%, the FT &#8220;around half a percentage point&#8221;.
What all these reports overlook is that MySQL&#8217;s influence is much greater than its market share, not only in terms of more widespread unpaid usage, but also in terms of the ecosystem of vendors that are building products based on MySQL to tap into its widespread adoption.
Examples include Kickfire, Infobright and Calpont in data warehousing, ScaleDB in shared-disk clustering, Tokutek in Web-application querying, and Schooner Information Technology and Virident Systems in caching appliances. 
All of these products enable mySQL to better compete with Oracle&#8217;s database products, and many of these have commercial relationships with Sun that enable them to use MySQL in proprietary products (while Infobright is itself open source, it also has a relationship with Sun).
Calpont also plans to offer an open source data warehouse based on MySQL but has put is plans on hold while it waits to see what Oracle will do with the MySQL database. Calpont&#8217;s concern is that Oracle will choose not to promote commercial relationships that use MySQL to compete more directly with Oracle&#8217;s Database business.
The MariaDB fork provides a potential alternative for these vendors, but as we previously discussed on this blog there are questions as to whether closed-source MySQL storage engines are compatible with MariaDB.
As noted in that post, ScaleDB&#8217;s Mike Hogan has argued that it can be done via an open source intermediary layer (and given that ScaleDB does not have a commercial arrangement with Sun, the company will be hoping that its analysis is correct), but MariaDB and MySQL creator Monty Widenius is not convinced: “This can only be done by buying MySQL licenses from Sun for each copy of MariaDB that is distributed.”
If Monty is correct then Oracle&#8217;s impending ownership of MySQL could theoretically have a significant impact on the emerging market for commercial products based on MySQL and their ability to compete with the Oracle Database. 
As we noted in a report on the wider implications of Oracle&#8217;s impending ownership of MySQL (451 subscribers only) &#8220;For the commercial arrangements between these vendors and Oracle to survive, they will have to show that they can provide value to MySQL without impacting Oracle.&#8221;
Is that anti-competitive? Perhaps. I would argue that it certainly warrants further investigation.
]]></description>
			<content:encoded><![CDATA[<p>By now you are probably aware that the European Commission has <a href="http://europa.eu/rapid/pressReleasesAction.do?reference=IP/09/1271&amp;format=HTML&amp;aged=0&amp;language=EN&amp;guiLanguage=en">decided</a> to launch an extended investigation into Oracle&#8217;s acquisition of Sun based on concerns over MySQL.</p>
<p>The new has prompted a lot of criticism of the EC, much of it suggesting that the delay will do considerable harm to Sun (and therefore Oracle). This argument is valid - Sun&#8217;s already declining revenue has been in freefall since the deal was announced and one wonders how far it will fall in another 90 days of stasis.</p>
<p>Other criticism, (such as <a href="http://news.cnet.com/8301-13505_3-10344537-16.html">this</a> from Matt Asay) focuses on the suggestion that the delay will do little to help MySQL or its users, and that the EC fails to understand open source.</p>
<p>This also has some validity. The EC talks about &#8220;Oracle&#8217;s incentive to further develop MySQL as an open source database&#8221; but as Matt points out &#8220;even Oracle can&#8217;t put the open-source genie back in the bottle once it has been released, as MySQL has, under the GNU General Public License.&#8221;</p>
<p>This is true. although I would argue, that Oracle&#8217;s potential control over MySQL is not about licensing, but copyright. The FT <a href="http://blogs.ft.com/techblog/2009/09/brussels-sun-delay-leaves-oracle-in-a-quandry/">states</a> that Oracle &#8220;doesn’t control the IP, since the software is available under the GPL&#8221;. That is not entirely true. The existing code will always be under the GPL but as the copyright for that code would be fully-owned by Oracle it is under no obligation to release future developments under the GPL.</p>
<p>I do not expect that to happen, but copyright ownership does not just impact the ability to license code, it also provides control over potential commercial uses of that code. This is where it could be argued that the EC could be right to have anti-competitive concerns over Oracle&#8217;s future ownership of MySQL (even if it doesn&#8217;t understand why, or hasn&#8217;t articulated that it does).</p>
<p>Criticism of the EC has also suggested that it is disproportionately focusing on a products with a tiny market share. There are various suggestions as to quite how small MySQL&#8217;s market share is, with the WSJ <a href="http://online.wsj.com/article/SB10001424052970204731804574390512306888466.html">citing</a> 0.2%, but <a href="http://online.wsj.com/article/BT-CO-20090903-713599.html">also</a> 1.5%, <a href="http://www.allheadlinenews.com/articles/7016296678?Oracle-Sun%20Merger%20Catches%20Eyes%20Of%20EU%20Competition%20Czar">AHN</a> 0.04%, <a href="http://blogs.ft.com/techblog/2009/09/brussels-sun-delay-leaves-oracle-in-a-quandry/">the FT</a> &#8220;around half a percentage point&#8221;.</p>
<p>What all these reports overlook is that MySQL&#8217;s influence is much greater than its market share, not only in terms of more widespread unpaid usage, but also in terms of the ecosystem of vendors that are building products based on MySQL to tap into its widespread adoption.</p>
<p>Examples include Kickfire, Infobright and Calpont in data warehousing, ScaleDB in shared-disk clustering, Tokutek in Web-application querying, and Schooner Information Technology and Virident Systems in caching appliances. </p>
<p>All of these products enable mySQL to better compete with Oracle&#8217;s database products, and many of these have commercial relationships with Sun that enable them to use MySQL in proprietary products (while Infobright is itself open source, it also has a relationship with Sun).</p>
<p>Calpont also plans to offer an open source data warehouse based on MySQL but has put is plans on hold while it waits to see what Oracle will do with the MySQL database. Calpont&#8217;s concern is that Oracle will choose not to promote commercial relationships that use MySQL to compete more directly with Oracle&#8217;s Database business.</p>
<p>The MariaDB fork provides a potential alternative for these vendors, but as we previously <a href="http://blogs.the451group.com/opensource/2009/05/21/are-closed-source-mysql-storage-engines-compatible-with-mariadb/">discussed</a> on this blog there are questions as to whether closed-source MySQL storage engines are compatible with MariaDB.</p>
<p>As noted in that post, ScaleDB&#8217;s Mike Hogan has argued that it can be done via an open source intermediary layer (and given that ScaleDB does not have a commercial arrangement with Sun, the company will be hoping that its analysis is correct), but MariaDB and MySQL creator Monty Widenius is not convinced: “This can only be done by buying MySQL licenses from Sun for each copy of MariaDB that is distributed.”</p>
<p>If Monty is correct then Oracle&#8217;s impending ownership of MySQL could theoretically have a significant impact on the emerging market for commercial products based on MySQL and their ability to compete with the Oracle Database. </p>
<p>As we noted in a <a href="http://the451group.com/report_view/report_view.php?entity_id=58014">report</a> on the wider implications of Oracle&#8217;s impending ownership of MySQL (451 subscribers only) &#8220;For the commercial arrangements between these vendors and Oracle to survive, they will have to show that they can provide value to MySQL without impacting Oracle.&#8221;</p>
<p>Is that anti-competitive? Perhaps. I would argue that it certainly warrants further investigation.</p>
<img src="http://feeds.feedburner.com/~r/451opensource/~4/1XCCTPZWvmg" height="1" width="1" /><br/>PlanetMySQL Voting:
	 <a href="http://planet.mysql.com/entry/vote/?entry_id=20935&vote=1&apivote=1">Vote UP</a> /
	 <a href="http://planet.mysql.com/entry/vote/?entry_id=20935&vote=-1&apivote=1">Vote DOWN</a>]]></content:encoded>
			<wfw:commentRss>http://planetmysql.ru/2009/09/04/the-ec-is-mostly-but-not-entirely-wrong-about-oraclemysql/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Open source’s role in lowering the barriers to data warehousing</title>
		<link>http://feedproxy.google.com/~r/451opensource/~3/oqDXA643QpA/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=open-source%25e2%2580%2599s-role-in-lowering-the-barriers-to-data-warehousing</link>
		<comments>http://feedproxy.google.com/~r/451opensource/~3/oqDXA643QpA/#comments</comments>
		<pubDate>Thu, 06 Aug 2009 15:45:29 +0000</pubDate>
		<dc:creator>The 451 Group</dc:creator>
				<category><![CDATA[451 group]]></category>
		<category><![CDATA[451caostheory]]></category>
		<category><![CDATA[451group]]></category>
		<category><![CDATA[aster data]]></category>
		<category><![CDATA[bizgres]]></category>
		<category><![CDATA[C-store]]></category>
		<category><![CDATA[calpont]]></category>
		<category><![CDATA[caostheory]]></category>
		<category><![CDATA[datallegro]]></category>
		<category><![CDATA[Eigenbase]]></category>
		<category><![CDATA[greenplum]]></category>
		<category><![CDATA[infobright]]></category>
		<category><![CDATA[ingres]]></category>
		<category><![CDATA[Kickfire]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[luciddb]]></category>
		<category><![CDATA[madison]]></category>
		<category><![CDATA[matt aslett]]></category>
		<category><![CDATA[mattaslett]]></category>
		<category><![CDATA[matthew aslett]]></category>
		<category><![CDATA[matthewaslett]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[netezza]]></category>
		<category><![CDATA[omne]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Software]]></category>

		<guid isPermaLink="false">http://blogs.the451group.com/opensource/?p=1201</guid>
		<description><![CDATA[As well as contributing to the CAOS research practice here at The 451 Group I am also part of the information management team, with a focus on databases, data caching, CEP, and - from the start of this year - data warehousing.
I&#8217;ve covered data warehousing before but taking a fresh look at this space in recent months it&#8217;s been fascinating to see the variety of technologies and strategies that vendors are applying to the data warehousing problem. It&#8217;s also been interesting to compare the role that open source has played in the data warehousing market, compared to the database market.
I&#8217;m preparing a major report on the data warehousing sector, for publication in the next couple of months. What follows is a rough outline of the role open source has played in the sector. Any comments or corrections much appreciated:
Unlike other sectors, where the role of open source has mostly been the disruption of incumbent proprietary vendors by commercial open source specialists, the impact of open source in the data warehousing sector has been more subtle, and arguably more pervasive.
Vendors such as Netezza and Greenplum have used the PostgreSQL database to build their data warehousing products, benefiting from the robust, mature PostgreSQL code base and reduced time to market. However, the end products of these development efforts are not open source. 
For example, Netezza built its Netezza Performance Server (NPS) data warehouse appliances around Red Hat Linux and PostgreSQL, although the BSD license used by the PostgreSQL project enabled the company to do so without its resulting database having to be made available under an open source license. Additionally, Aster Data makes use of PostgreSQL as a data store on each node of its nCluster massively parallel data warehouse.
Similarly Greenplum also used PostgreSQL as the basis for its massively-parallel Greenplum Database and also set up and supported the Bizgres distribution with business intelligence and data warehousing specific contributions made available under the BSD license. However that project fizzled out and the website is now closed, although Greenplum’s use of PostgreSQL continues.
Another example of PostgreSQL usage comes from Paraccel, which used the PostgreSQL optimizer code in version 1.0 of its Analytic Database in order to improve time to market. That is now being replaced by a new optimizer called Omne, which is specifically designed to support the MPP columnar architecture of Paraccel and its compression capabilities, unlike the SMP PostgreSQL optimizer, which was extended to support MPP. While Omne retains some elements of the open source PostgreSQL optimizer code base, Paraccel claims it will remove all PostgreSQL code from its products with an update to the Omne technology in 2010.
Additionally Vertica, which was founded by Mike Stonebraker, creator of PostgreSQL and Ingres, is a commercial implementation of the C-Store academic research project, which was also licensed under BSD.
It is also worth mentioning that prior to its acquisition by Microsoft, DATAllegro made use of a commercial license of the open source Ingres database within its data warehousing appliances. DATAllegro actually did most of the early development work for its first appliance using PostgreSQL, but decided to change to Ingres late in 2004 to make use of partitioning capabilities, backup utilities and optimizer features. Needless to day Ingres is being replaced by Microsoft SQL Server in Microsoft’s forthcoming Madison data warehouse appliances.
LucidDB is another, often overlooked open source database, and was purpose-built for data warehousing. Based on technology developed by Broadbase Software, the code was picked up by erstwhile business intelligence SaaS provider LucidEra and combined with the Eigenbase data management framework to create LucidDB. Following LucidEra’s recent demise the LucidDB code is not currently commercially supported, although the non-profit Eigenbase Foundation is continuing to sponsor its development.
Despite this rampant use of open source code, it was not until Infobright launched Infobright Community Edition (ICE) in 2008 that we saw the first commercial open source vendor delivering its core warehouse software under an open source license. The Infobright columnar database acts as a storage engine for the MySQL database turning it  into a realistic option for data warehouses of more than 200GB according to Infobright (Sun maintains that MySQL can perform as a stand-alone data-warehousing platform up to 2TB with the default MyISAM non-transactional storage engine).
While MySQL is not well known as a platform for data warehousing, Sun’s internal surveys indicate that data warehousing is the fifth-most-common use case for MySQL, which explains why it is not just Infobright that is looking to build a data warehousing business around MySQL.
Kickfire emerged in April 2008 with a beta version of its MySQL Appliance, which is built around the MySQL database and its SQL chip, which provides native instruction execution while operating directly out of memory on compressed data. Kickfire is targeting deployments in the 100GB-3TB range, while Infobright acts as a MySQL storage engine to enable use with up to 30TB of data. Infobright is developing a shared-everything, peer-to-peer architecture that will support up to 100 concurrent users and 100TB of data. Delivery is scheduled for the fourth quarter.
It remains to be seen whether Oracle will retain its commercial relationships with Kickfire and Infobright once its acquisition of Sun, and therefore MySQL, closes, but one company that has already been impacted by the acquisition its Calpont, which had planned to make a big splash at the recent MySQL Conference &#38; Expo with the launch of its new strategy to provide a data-warehousing storage engine for the MySQL database.
The plan, to offer an open source column-oriented storage engine that will provide the MySQL database with the capabilities to function as a data warehouse, scaling from capacities of 100GB to 100TB, remains in place, although the storage engine will be in beta testing for the foreseeable future while Calpont waits to see what Oracle will do.
The most recent open source entrant into the data warehousing market is Ingres, which has teamed up with VectorWise, a database-engine spin-off from Amsterdam&#8217;s Centrum Wiskunde &#38; Informatica (CWI) scientific research establishment, to collaborate on a new database-kernel project designed to better enable it to be positioned as a platform for data-warehouse and analytic workloads. he resulting software will be fully open source although Ingres does not have detailed plans for the productization of the technology at this stage.
While open source is playing an increasing role in the data warehousing market, PostgreSQL has primarily taken the role of lowering barriers to entry for new vendors by providing a platform for the development of data warehouse-specific capabilities on a proven database platform.
MySQL serves a similar role for Infobright, Kickfire and Calpont, but could also play a significant role in lowering barriers to entry for new data warehousing customers with small volumes of data.
Calpont turned its attention to MySQL and the midrange market in order to exploit the requirement for scalable data-warehousing capabilities from MySQL&#8217;s estimated 11 million users, as well as the fact that the low-end of the market has not been well-supported by the existing data-warehousing vendors.
Sun estimates that 90% of all data warehouses have 6TB of data or less, while Kickfire estimates there are 17,000 addressable accounts that are trying to use MySQL to create data warehouses with volumes greater than 50GB.
These estimates explain why Sun et al see an opportunity for MySQL-based warehouses to grab a slice of the market based on a low cost systems targeting a large number of customers and small amounts of data – the complete inverse of the traditional focus for data warehousing requirements, which is based on high cost systems supporting large amounts of data and a relatively small number of potential customers.
Additionally, Kickfire, Infobright and Calpont are looking to replicate the strategy MySQL successful followed in the database market by targeting a market niche that is not being served by the incumbents and avoid competing head on with the likes of Teradata, IBM, Oracle and Netezza.
]]></description>
			<content:encoded><![CDATA[<p>As well as contributing to the CAOS research practice here at The 451 Group I am also part of the information management team, with a focus on databases, data caching, CEP, and - from the start of this year - data warehousing.</p>
<p>I&#8217;ve covered data warehousing before but taking a fresh look at this space in recent months it&#8217;s been fascinating to see the variety of technologies and strategies that vendors are applying to the data warehousing problem. It&#8217;s also been interesting to compare the role that open source has played in the data warehousing market, compared to the <a href="http://www.the451group.com/caos/caos_detail.php?icid=539">database market</a>.</p>
<p>I&#8217;m preparing a major report on the data warehousing sector, for publication in the next couple of months. What follows is a rough outline of the role open source has played in the sector. Any comments or corrections much appreciated:</p>
<p>Unlike other sectors, where the role of open source has mostly been the disruption of incumbent proprietary vendors by commercial open source specialists, the impact of open source in the data warehousing sector has been more subtle, and arguably more pervasive.</p>
<p>Vendors such as <a href="http://www.netezza.com/twinfin.aspx">Netezza</a> and <a href="http://www.greenplum.com/">Greenplum</a> have used the <a href="http://www.postgresql.org/">PostgreSQL</a> database to build their data warehousing products, benefiting from the robust, mature PostgreSQL code base and reduced time to market. However, the end products of these development efforts are not open source. </p>
<p>For example, Netezza built its Netezza Performance Server (NPS) data warehouse appliances around Red Hat Linux and PostgreSQL, although the BSD license used by the PostgreSQL project enabled the company to do so without its resulting database having to be made available under an open source license. Additionally, <a href="http://www.asterdata.com/">Aster Data</a> makes use of PostgreSQL as a data store on each node of its nCluster massively parallel data warehouse.</p>
<p>Similarly Greenplum also used PostgreSQL as the basis for its massively-parallel Greenplum Database and also set up and supported the Bizgres distribution with business intelligence and data warehousing specific contributions made available under the BSD license. However that project <a href="http://pgfoundry.org/pipermail/bizgres-general/2008-February/date.html#start">fizzled out</a> and the website is now closed, although Greenplum’s use of PostgreSQL continues.</p>
<p>Another example of PostgreSQL usage comes from <a href="http://www.paraccel.com/">Paraccel</a>, which used the PostgreSQL optimizer code in version 1.0 of its Analytic Database in order to improve time to market. That is now being replaced by a new optimizer called Omne, which is specifically designed to support the MPP columnar architecture of Paraccel and its compression capabilities, unlike the SMP PostgreSQL optimizer, which was extended to support MPP. While Omne retains some elements of the open source PostgreSQL optimizer code base, Paraccel claims it will remove all PostgreSQL code from its products with an update to the Omne technology in 2010.</p>
<p>Additionally <a href="http://www.vertica.com/">Vertica</a>, which was founded by Mike Stonebraker, creator of PostgreSQL and Ingres, is a commercial implementation of the <a href="http://db.csail.mit.edu/projects/cstore/">C-Store</a> academic research project, which was also licensed under BSD.</p>
<p>It is also worth mentioning that prior to its <a href="http://www.datallegro.com/pr/7_24_08_microsoft_acquisition.asp">acquisition</a> by Microsoft, DATAllegro <a href="http://www.datallegro.com/pr/2_20_07_datallegro_ingres.asp">made use</a> of a commercial license of the open source Ingres database within its data warehousing appliances. DATAllegro actually did most of the early development work for its first appliance using PostgreSQL, but decided to change to Ingres late in 2004 to make use of partitioning capabilities, backup utilities and optimizer features. Needless to day Ingres is being replaced by Microsoft SQL Server in Microsoft’s forthcoming <a href="http://www.microsoft.com/sqlserver/2008/en/us/madison.aspx">Madison</a> data warehouse appliances.</p>
<p><a href="http://www.luciddb.org/">LucidDB</a> is another, often overlooked open source database, and was purpose-built for data warehousing. Based on technology developed by Broadbase Software, the code was picked up by erstwhile business intelligence SaaS provider LucidEra and combined with the <a href="http://www.eigenbase.org/">Eigenbase</a> data management framework to create LucidDB. Following LucidEra’s recent demise the LucidDB code is not currently commercially supported, although the non-profit Eigenbase Foundation is continuing to sponsor its development.</p>
<p>Despite this rampant use of open source code, it was not until <a href="http://www.infobright.com/">Infobright</a> launched Infobright Community Edition (ICE) in 2008 that we saw the first commercial open source vendor delivering its core warehouse software under an open source license. The Infobright columnar database acts as a storage engine for the MySQL database turning it  into a realistic option for data warehouses of more than 200GB according to Infobright (Sun maintains that MySQL can perform as a stand-alone data-warehousing platform up to 2TB with the default <a href="http://dev.mysql.com/doc/refman/5.1/en/myisam-storage-engine.html">MyISAM</a> non-transactional storage engine).</p>
<p>While <a href="http://www.mysql.com/">MySQL</a> is not well known as a platform for data warehousing, Sun’s internal surveys indicate that data warehousing is the fifth-most-common use case for MySQL, which explains why it is not just Infobright that is looking to build a data warehousing business around MySQL.</p>
<p><a href="http://www.kickfire.com/">Kickfire</a> emerged in April 2008 with a beta version of its MySQL Appliance, which is built around the MySQL database and its SQL chip, which provides native instruction execution while operating directly out of memory on compressed data. Kickfire is targeting deployments in the 100GB-3TB range, while Infobright acts as a MySQL storage engine to enable use with up to 30TB of data. Infobright is developing a shared-everything, peer-to-peer architecture that will support up to 100 concurrent users and 100TB of data. Delivery is scheduled for the fourth quarter.</p>
<p>It <a href="http://dealbook.blogs.nytimes.com/2009/05/18/what-will-oracle-do-with-mysql/">remains to be seen</a> whether Oracle will retain its commercial relationships with Kickfire and Infobright once its acquisition of Sun, and therefore MySQL, closes, but one company that has already been impacted by the acquisition its <a href="http://www.calpont.com/">Calpont</a>, which had planned to make a big splash at the recent MySQL Conference &#038; Expo with the launch of its new strategy to provide a data-warehousing storage engine for the MySQL database.</p>
<p>The plan, to offer an open source column-oriented storage engine that will provide the MySQL database with the capabilities to function as a data warehouse, scaling from capacities of 100GB to 100TB, remains in place, although the storage engine will be in beta testing for the foreseeable future while Calpont waits to see what Oracle will do.</p>
<p>The most recent open source entrant into the data warehousing market is <a href="http://www.ingres.com/">Ingres</a>, which has teamed up with <a href="http://www.vectorwise.com/index_js.php?page=mission_overview">VectorWise</a>, a database-engine spin-off from Amsterdam&#8217;s Centrum Wiskunde &#038; Informatica (CWI) scientific research establishment, to <a href="http://blogs.the451group.com/information_management/2009/07/29/ingres-launches-project-for-in-memory-columnar-vectorized-database-engine/">collaborate</a> on a new database-kernel <a href="http://www.ingres.com/vectorwise/">project</a> designed to better enable it to be positioned as a platform for data-warehouse and analytic workloads. he resulting software will be fully open source although Ingres does not have detailed plans for the productization of the technology at this stage.</p>
<p>While open source is playing an increasing role in the data warehousing market, PostgreSQL has primarily taken the role of lowering barriers to entry for new vendors by providing a platform for the development of data warehouse-specific capabilities on a proven database platform.</p>
<p>MySQL serves a similar role for Infobright, Kickfire and Calpont, but could also play a significant role in lowering barriers to entry for new data warehousing customers with small volumes of data.</p>
<p>Calpont turned its attention to MySQL and the midrange market in order to exploit the requirement for scalable data-warehousing capabilities from MySQL&#8217;s estimated 11 million users, as well as the fact that the low-end of the market has not been well-supported by the existing data-warehousing vendors.</p>
<p>Sun estimates that 90% of all data warehouses have 6TB of data or less, while Kickfire estimates there are 17,000 addressable accounts that are trying to use MySQL to create data warehouses with volumes greater than 50GB.</p>
<p>These estimates explain why Sun et al see an opportunity for MySQL-based warehouses to grab a slice of the market based on a low cost systems targeting a large number of customers and small amounts of data – the complete inverse of the traditional focus for data warehousing requirements, which is based on high cost systems supporting large amounts of data and a relatively small number of potential customers.</p>
<p>Additionally, Kickfire, Infobright and Calpont are looking to replicate the strategy MySQL successful followed in the database market by targeting a market niche that is not being served by the incumbents and avoid competing head on with the likes of Teradata, IBM, Oracle and Netezza.</p>
<img src="http://feeds.feedburner.com/~r/451opensource/~4/oqDXA643QpA" height="1" width="1" /><br/>PlanetMySQL Voting:
	 <a href="http://planet.mysql.com/entry/vote/?entry_id=20528&vote=1&apivote=1">Vote UP</a> /
	 <a href="http://planet.mysql.com/entry/vote/?entry_id=20528&vote=-1&apivote=1">Vote DOWN</a>]]></content:encoded>
			<wfw:commentRss>http://planetmysql.ru/2009/08/06/open-source%e2%80%99s-role-in-lowering-the-barriers-to-data-warehousing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

