Archive for the ‘rackspace’ Category

451 CAOS Links. 2011.12.02

Декабрь 2nd, 2011

Talend delivers v5. Zentyal raises series A. The TCO of OSS. And more.

# Talend announced version 5 of its data integration suite, adding business process management capabilities via an OEM relationship with BonitaSoft. Yves De Montcheuil explained the name changes in version 5.

# Zentyal closed a series A venture capital funding of over $1m by Open Ocean Capital.

# The London School of Economics released a report on the total cost of ownership of open source software.

# Couchbase announced the availability of the Couchbase Hadoop Connector, developed in conjunction with Cloudera.

# Rackspace announced the private beta of Rackspace MySQL Cloud Database.

# The debate over the role of open source foundations in the Git era continued, including a follow-up by the instigator, Mikael Rogers, a rallying cry for autonomy from Ceki Gülcü, and Simon Phipps warning about throwing the baby out with the bathwater.

# Marco Abis is stepping down as CEO of Sourcesense.

# NGINX usage has grown almost 300% over the last year, according to Netcraft figures discussed by Royal Pingdom.

# The Wireless Innovation Forum announced the formation of the Open Source Framework for Commercial Baseband Software project.


PlanetMySQL Voting: Vote UP / Vote DOWN

451 CAOS Links 2011.10.07

Октябрь 7th, 2011

OpenStack Foundation. New Pentaho CEO. And more.

# Rackspace announced its intention to form an independent OpenStack Foundation.

# HP has chosen Ubuntu as the lead host and guest operating system for its Public Cloud.

# Pentaho appointed Quentin Gallivan as its new CEO.

# Hortonworks continued the discussion about contributions to Apache Hadoop.

# Bob Bickel explained why CloudBees is not, itself, open source.

# Google announced the limited preview release of Google Cloud SQL.

# Eucalyptus Systems, Nebula and Virtual Bridges joined the Linux Foundation.

# Dave Neary discussed the different types of community in relation to the Tizen project.

# Akamai joined the OpenStack community.

# Daniel Abadi provided his perspective on Oracle’s NoSQL Database.

# One more thing…
Apple’s relationship with open source may be somewhat tenuous – Paul Rooney provides some background – but given the impact Steve Jobs has made on the industry as a whole it seems wrong not to mark his passing in some way. We’ll leave the words to the company he created.


PlanetMySQL Voting: Vote UP / Vote DOWN

Clearing the global status

Ноябрь 21st, 2010

If you want to clear local session status variables this is easy, simply execute FLUSH STATUS.  Unfortunately doing the same for global status variables is not as easy.

That is until now that is.  In the Drizzle7 beta release 2010-11-08 is a new command: FLUSH GLOBAL STATUS

I created this command to fulfil a Rackspace feature request but it appears this has also been a MySQL feature request in the past.  The implementation would be very different in MySQL since the status variables and sessions work differently, but it would certainly be good to see in a future release, and it appears patches already exist for it.


PlanetMySQL Voting: Vote UP / Vote DOWN

Rackspace Rookie-O (in Hong Kong!)

Октябрь 25th, 2010

I’d meant to finish writing this way back in July… but I failed at that. Now is a good time to talk about Rookie-O as my again new colleague Andrew Hutchings (Buy his and Sergei’s book on MySQL 5.1 Plugin Development!) just went through the same thing (but in London instead of Hong Kong) given by the same trainer (Hi Eddie!).

Rackspace is the second employer I’ve had that has some kind of new hire training (the first being Sun). I am, of course, not quite counting Salmiakki as new-hire training for MySQL (although I probably should). To quote from the Wikipedia article: “Although the rumor of the heart attack was a hoax, the drink may still cause harm. The strong flavor almost completely masks the presence of ethanol, and the drinker may not realize he is consuming a drink almost 40% alcohol by volume (80-proof), leading to possible alcohol poisoning.” A promising introduction to the company.

Monty, Mårten and Kaj with Salmiakki singing Helan Går at the MySQL User Conference Japan in 2007

Monty, Mårten and Kaj with Salmiakki singing Helan Går at the MySQL User Conference Japan in 2007

I could possibly say something about the Sun New-Hire training… but I’m just trying to find something positive to say – and I can’t. I got a bit of hacking done? Seriously.

Actually coordinating a time to attend a Rookie-O (Rookie Orientation, the Rackspace name for new hire training) was rather tricky. There was one right before the MySQL User Conference back in April (not the best of timing), one during an upcoming team meeting (again, not ideal) and one that got organised in the middle of everything for the office in Hong Kong. So, I headed to Hong Kong.

Hong Kong streetlife

The Hong Kong office is relatively new (late 2008) and there were people there who hadn’t gone through the standard Rackspace Rookie-O (Orientation).

Rackers walking Hong Kong at Night

It was rather cool to hang out with other people who worked for the company – and in totally different areas than I do. I did get a better understanding for how the rest of the company operates and the people involved. The training itself was useful and substantially less geared towards not-my-job than Sun’s was.

The good news is that Andrew thought it was useful too. Pretty impressed so far.


PlanetMySQL Voting: Vote UP / Vote DOWN

Open source in the clouds and in the debates

Сентябрь 8th, 2010

We continue to see more evidence of the themes we discuss in our latest CAOS special report, Seeding the Clouds, which examines the open source software used in cloud computing, the vendors backing open source, the cloud providers using it and the impact on the industry.

First, as usual, we are seeing consistencies between our own research — which indicates open source is a huge part of today’s cloud computing offerings from major providers like Amazon, Google, Rackspace, Terremark and VMware — and that of code analysis and management vendor Black Duck. In its analysis of code that runs the cloud, Black Duck also found a preponderance of open source pieces, in many cases the same projects we profile in our report.

Indeed, open source software is an important part of the infrastructure, data and application layers of today’s cloud computing stacks with significant use of Linux, open source hypervisors KVM and Xen, open source data technologies such as MySQL, PostgreSQL, Hadoop, NoSQL and memcached and open source languages such as Java, PHP, Python and Ruby on Rails.

There will be plenty of users and customers content to use non-open source options that serve as the defacto standards, but we do see a move to higher-level, production and mission critical use, which represents continued commercial opportunity for open source and other vendors.

One of the more subtle effects of all this open source in the cloud, as covered in Seeding the Clouds, is the impact on discussions, debates and downright fights in the market. There is much scrutiny on claims of being open, technical aspects of open and what ‘open cloud’ means. A prime example is the Twisticuffs that have gone on between Simon Crosby of XenSource and Citrix, discussing OpenCloud and the response from Open Cloud Initiative co-founder Sam Johnston, who claims this is misuse of the open label.

We already saw open source playing a role in the discussions and debates about open clouds, open APIs and open data, and this latest confrontation is evidence that role continues to be significant. We still wonder though about the question of open enough as we contemplate openness in the clouds.


PlanetMySQL Voting: Vote UP / Vote DOWN

HOWTO screw up launching a free software project

Июль 27th, 2010

Josh Berkus gave a great talk at linux.conf.au 2010 (the CFP for linux.conf.au 2011 is open until August 7th) entitled “How to destroy your community” (lwn coverage). It was a simple, patented, 10 step program, finely homed over time to have maximum effect. Each step is simple and we can all name a dozen companies that have done at least three of them.

Simon Phipps this past week at OSCON talked about Open Source Continuity in practice – specifically mentioning some open source software projects that were at Sun but have since been abandoned by Oracle and different strategies you can put in place to ensure your software survives, and check lists for software you use to see if it will survive.

So what can you do to not destroy your community, but ensure you never get one to begin with?

Similar to destroying your community, you can just make it hard: “#1 is to make the project depend as much as possible on difficult tools.

#1 A Contributor License Agreement and Copyright Assignment.

If you happen to be in the unfortunate situation of being employed, this means you get to talk to lawyers. While your employer may well have an excellent Open Source Contribution Policy that lets you hack on GPL software on nights and weekends without a problem – if you’re handing over all the rights to another company – there gets to be lawyer time.

Your 1hr of contribution has now just ballooned. You’re going to use up resources of your employer (hey, lawyers are not cheap), it’s going to suck up your work time talking to them, and if you can get away from this in under several hours over a few weeks, you’re doing amazingly well – especially if you work for a large company.

If you are the kind of person with strong moral convictions, this is a non-starter. It is completely valid to not want to waste your employers’ time and money for a weekend project.

People scratching their own itch, however small is how free software gets to be so awesome.

I think we got this almost right with OpenStack. If you compare the agreement to the Apache License, there’s so much common wording it ends up pretty much saying that you agree you are able to submit things to the project under the Apache license.  This (of course) makes the entire thing pretty redundant as if people are going to be dishonest about submitting things under the Apache licnese there’s no reason they’re not going to be dishonest and sign this too.

You could also never make it about people – just make it about your company.

#2 Make it all about the company, and never about the project

People are not going to show up, do free work for you to make your company big, huge and yourself rich.

People are self serving. They see software they want only a few patches away, they see software that serves their company only a few patches away. They see software that is an excellent starting point for something totally different.

I’m not sure why this is down at number three… it’s possibly the biggest one for danger signs that you’re going to destroy something that doesn’t even yet exist…

#3 Open Core

This pretty much automatically means that you’re not going to accept certain patches for reasons of increasing your own company’s short term profit. i.e. software is no longer judged on technical merits, but rather political ones.

There is enough politics in free software as it is, creating more is not a feature.

So when people ask me about how I think the OpenStack launch went, I really want people to know how amazing it can be to just not fuck it up to begin with. Initial damage is very, very hard to ever undo. The number of Open Source software projects originally coming out of a company that are long running, have a wide variety of contributors and survive the original company are much smaller than you think.

PostgreSQL has survived many companies coming and going around it, and is stronger than ever. MySQL only has a developer community around it almost in spite of the companies that have shepherded the project. With Drizzle I think we’ve been doing okay – I think we need to work on some things, but they’re more generic to teams of people working on software in general rather than anything to do with a company.


PlanetMySQL Voting: Vote UP / Vote DOWN

At OSCON

Июль 20th, 2010

I’m at OSCON this week. Come say hi and talk Drizzle, Rackspace, cloud, photography, vegan food or brewing.


PlanetMySQL Voting: Vote UP / Vote DOWN

Is OpenStack Cloud Computing Rocket Science?

Июль 19th, 2010

There’s a real explosion of cloud platforms and management tools, it seems you can’t swing a dead cat without hitting one these days. In the commercial proprietary solutions space you have – CA’s 3Terra AppLogic, Enomaly, Nimbula, RightScale. In open source there are EucalyptusCloud.com, Open Nebula and Ubuntu Enterprise Cloud. There are a bunch more that I failed to mention. It makes you wonder do we really need another one? How much different can they be?Rackspace Champion's Open Source Cloud I am not sure but the newest one appears to be rather significant.

Today Rackspace has thrown their hat in the ring with their new OpenStack initiative in collaboration with NASA — as in rocket scientists, smartest guys in the world. Unlike Amazon’s EC2 which preaches open APIs, Rackspace is working to develop an open source platform that compliments their hosted cloud offering. They also have a strong open source partner in NASA who has been working on their own cloud computing platform, NASA Nebula. NASA Nebula will now become the cornerstone for the OpenStack initiative.

The goal of OpenStack is to allow any organization to create and offer cloud computing capabilities using open source software running on standard hardware. The project boasts both a compute and storage component. OpenStack Compute is software for automatically creating and managing large groups of virtual private servers and is available as a developer’s preview with a release target of October. OpenStack Storage is software for creating redundant, scalable object storage using clusters of commodity servers to store terabytes or even petabytes of data. Also available as a developer preview the OpenStack Storage project expects to release a production ready version in mid-September.

Adding the Rackspace hosting model to a strong open source project makes this approach to cloud computing especially interesting. Giving private cloud users a logical migration path to public cloud use. The question effect will this initiative have to truly drive open cloud computing standards.

OpenStack, A Foundation for Hybrid Clouds?

This initiative while founded on open source is not necessarily the cure for lock-in but it does go much farther than anyone else offering a fully accessible reference architecture available as open source. The closest comparison I see is Eucalyptus that is mimicks the Amazon EC2 cloud compute architecture (though not AmazOpen Stack - Open Source Cloud Computingon S3), though Amazon and Eucalyptus don’t seem to share a commonly agreed upon road map but rather a leader-follower relationship.

OpenStack’s formula is more coordinated and with a respectable user to champion it, NASA. The U.S space agency has one of the most compelling publicly documented private cloud computing stories.  NASA has gone so far as to package their solution in small footprint shipping containers to distribute among NASA research centers. These portable data centers are a model that many organizations looking to build private clouds are watching with interest.

I like that private clouds built on the OpenStack reference architecture should be fully compatible with Rackspace hosting services. Giving users the choice to run their own cloud or host or adopt a hybrid model. It’s not unlike open source adoption models were users download a free software version that has compatibility with a commercially supported version.  Plus this is not Rackspace’s only foray into open source distributed computing as they support the Apache-hosted Cassandra project, a highly scalable distributed database, and have been showing their support at numerous cloud and open source events.

OpenStack Web Interface

The OpenStack Web Interface

The strong message accompanying the launch is one of open standards and prevention of cloud lock-in. Lew Moorman, President, Cloud and CSO at Rackspace states this clearly that OpenStack wants to prevent vendor lock-in:

“We are founding the OpenStack initiative to help drive industry standards, prevent vendor lock-in and generally increase the velocity of innovation in cloud technologies.”

This is not unlike VMware who echoed that sentiment with an announcement this spring to collaborate with Google AppEngine.  VMware’s CTO Steve Herrod stated that they too were committed to open standards and preventing login:

“Our shared vision is to make it easy to build, run, and manage applications for the cloud, and to do so in a way that makes the applications portable across clouds. The rich applications should be able to run in an enterprise’s private cloud, on Google’s AppEngine, or on other public clouds committed to similar openness.”

So with all this openness and commitment to open standards is the ability to move from cloud to cloud seamlessly just around the corner? This remains to be seen it but the initiatives all seem to be well-intentioned and moving in the right direction.

Does Open Source Prevent Cloud Lock-in?

I don’t know whether Rackspace’s OpenStack will truly prevent cloud lock-in but it does seem to be well-intentioned. Though I  believe the following things need to happen to insure cloud lock-in doesn’t become a rampant problem:

  • Virtualization Portability – At a very simple level users need to be able to move from virtualization technologies including those hosted in the cloud need to be able to migrate seamlessly, that includes VMs running in VMware , Xen, HyperV and KVM. Then once in the cloud they need to be able to move across clouds both public and private unencumbered — Amazon, Rackspace, Eucalyptus, Ubuntu Enterprise Cloud and others. Adoption of a widespread virtualization standard like Open Virtualization Format (OVF) could help (OpenStack does seem to already support OVF, a good sign).
  • Data Portability – Just as compute environments move so should data, but not only move but be accessible across network and cloud infrastructures with high fidelity.  Security of that data goes without saying but adds another layer of complexity.
  • Cross Environment Tools – Finally, the tools to managing these environments need to manage both cloud and legacy architectures to insure that  the management of these new computing paradigms don’t make things even more complicated.

I hope OpenStack helps drive this vision. However to deliver on the true vision of true portability across cloud platforms other cloud providers and vendors other than Rackspace will have to participate.

Related Articles

Technorati Tags: , , , , , , , , , , , ,


PlanetMySQL Voting: Vote UP / Vote DOWN

linux.conf.au 2011 CFP Open!

Июль 15th, 2010

Head on over to http://lca2011.linux.org.au/ and check it out!

You’ve got until August 7th to put in a paper, miniconf, poster or tutorial.

Things I’d like to see come from my kinda world:

  • topics on running large numbers of machines
  • latest in large scale web infrastructure
  • latest going on in the IO space: (SSD, filesystems, SSD as L2 cache)
  • Applications of above technologies and what it means for application performance
  • Scalable and massive tcp daemons (i.e. Eric should come talk on scalestack)
  • exploration of pain points in current technologies and discussion on ways to fix them (from people really in the know)
  • A Hydra tutorial: starting with stock Ubuntu lucid, and exiting the tutorial with some analysis running on my project.
  • Something that completely takes me off guard and is awesome.

I’d love to see people from the MySQL, Drizzle and Rackspace worlds have a decent presence. For those who’ve never heard of/been to an LCA before: we reject at least another whole conference worth of papers. It’s the conference on the calendar that everything else moves around.


PlanetMySQL Voting: Vote UP / Vote DOWN

A review of Cloud Application Architectures by George Reese

Июль 5th, 2010
Cloud Application Architectures

Cloud Application Architectures

Cloud Application Architectures. By George Reese, O’Reilly 2009. (Here’s a link to the publisher’s site).

This is a great book on how to build apps in the cloud! I was happy to see how much depth it went into. It’s short — 150 pages plus some appendixes — so I was expecting it to be a superficial overview. But it isn’t. It is thorough. And it is also obviously built on his own experience building very specific applications that he uses to run his business — he isn’t preaching about stuff he doesn’t know first-hand. Finally, George Reese is a good writer! It’s impressive. This is how he covers so much ground with so much depth in so few pages, and it all makes sense. He takes a side trip every now and then, but it’s always in the right place at the right time — how to do a snapshot for backups, for example — and isn’t distracting. For a technical book, it has an amazing narrative flow.

The book begins with an intro to cloud computing in general, with definitions and an explanation of different models, plus cost estimates of traditional IT, managed hosting, and cloud computing for an app. There’s a brief overview of the Amazon platform. This book is mostly about Amazon, and states that up front. There are references and comparisons to other providers throughout, and later there’ll be two appendixes on GoGrid and Rackspace, each written by a representative of that company. I was happy that the author brought in people to write those, instead of doing it himself. They are non-promotional in nature, and quite short. That adds value to the book, which would have been fine without them, honestly.

Back to chapter two now — a deeper introduction to Amazon, moving through all the major components, but especially EC2, S3, and EBS. Here we also start to see a focus on the platform as a whole — availability zones, security, redundancy, reliability. These topics are treated fairly and woven into every chapter. It’s clear that the author doesn’t want to isolate these topics, but rather explain them in context so your mind is always on them as each new topic is introduced. Chapter 3 picks all this up again: considering a move into the cloud? More cost comparisons, more explanations of concepts such as availability and how they translate into the Amazon cloud. Performance, disaster recovery and a few other topics show up here.

Chapter 4 is about how to build an app in the cloud: web app design, making multiple machines work together, handling failure, building AMIs, privacy, and operating databases (especially MySQL) in the cloud. The privacy section is particularly good. I’d recommend this to anyone building an app that might process personally identifiable information or financial information, in or out of the cloud. And as I said already, this is one of the types of things he weaves into the whole book. Chapter 5 picks right up and keeps going: it’s about security. Data security, regulatory compliance, network security, host security, how to respond if there’s a breach. And then Chapter 6 is on disaster recovery: planning, implementing, managing.

Chapter 7 is titled “scaling,” but it’s more than that. It starts with capacity planning. Here’s one of my favorite quotes: “some think they no longer need to engage in capacity planning… [others] think of tens or hundreds of thousands of dollars in consulting fees. Both thoughts are dangerous myths…” There’s a reference to John Allspaw’s excellent book on capacity planning. (I saw that he was a tech reviewer for this book, too.) This chapter covers how you predict and provision for capacity needs in the cloud, including the “automatic scaling” holy grail, how it can bite you, and how to keep that from happening. It also talks about how you scale vertically in the cloud. It doesn’t talk about why it’s hard to really be sure about your capacity needs in the cloud, but that’s okay given the other material covered in the chapter.

And that’s it! After this, it’s 3 appendixes. One is an AWS reference, and then there’s the two on GoGrid and Rackspace.

What’s to criticize? Well, not a lot really. I read every word in this book, I promise. Here’s what I noticed: he talked about database corruption from unexpected shutdowns — he should have said “use InnoDB,” because that’s pretty much a MyISAM problem. He talked about taking backups from replication slaves — he should have said “don’t just trust replication, verify it with mk-table-checksum.” I also think he encourages a little too much trust that cloud providers are always magically going to have the capacity you need; it felt a bit naive, but this is actually a fundamental point in whether you’re going to use the cloud or not. Nobody knows how much excess capacity Amazon has, and as we know, weird things happen. But if you’re going to embrace a cloud platform, you’re going to have to trust to a certain extent.

A couple other things to nitpick: in Chapter 1, when talking about availability, he writes “[if] even 1 minute of downtime in a year is entirely unacceptable, you almost certainly want to opt for a managed services environment… [if] 99.995% is good enough, you can’t beat the cloud.” But these numbers are unrealistic and don’t have enough context to explain what he means. Finally, in a couple of places he talks about algorithms for generating unique identifiers and dealing with concurrent access, but these don’t have a deep enough explanation to prevent novices from shooting themselves in the foot with wrong assumptions such as a timestamp will always increase between each subsequent access. But a savvy developer will recognize those problems and won’t be bitten.

This book is the first one to go onto my list of essential books in a while. I’ll be keeping this one on my own bookshelf.

Related posts:

  1. Review of Scalable Internet Architectures by Theo Schlossnagle
  2. Under-provisioning: the curse of the cloud
  3. A review of The Art of Capacity Planning by John Allspaw
  4. A review of Web Operations by John Allspaw and Jesse Robbins
  5. Book Review: Building powerful and robust websites with Drupal 6


PlanetMySQL Voting: Vote UP / Vote DOWN