Archive for the ‘PostgreSQL’ Category

International Women’s Day

Март 8th, 2010

If you do not know what International Women’s Day is: http://www.internationalwomensday.com/

Start planning your blog posts for Ada Lovelace day now (March 24th, http://findingada.com/ Ada Lovelace Day is an international day of blogging (videologging, podcasting, comic drawing etc.!) to draw attention to the achievements of women in technology and science.)

To that end, I would like to point out all the women currently in science and tech fields that I admire and think are doing great things. I think it would be great if everyone, male or female, made a list like this:

The women that have taught me science/tech along the way:

High School:
Mary Lou Ciavarra (Physics)
Maria Petretti (Pre-Algebra, and Academic Decathlon)
Reneé Fishman (Biology)
Lisa Acquaire (Economics during Academic Decathlon)

College:
Professor Kalpana White (Biology), and in whose fruit fly lab I worked for 2 semesters.
Professor Eve Marder (Introductory Neuroscience)

Though Brandeis does have female faculty in the Computer Science department, I did not manage to have any classes with female Computer Science faculty members.

My current female DBA co-workers at Pythian: Isabel Pinarci (Oracle), Michelle Gutzait (SQL Server), Catherine Chow (Oracle) and Jasmine Wen (Oracle).

And to folks in the greater MySQL/tech community and tech co-workers past and present, especially those I have been inspired and helped by: Tracy Gangwer, Selena Deckelmann (Postgres), Amy Rich, Anne Cross, and more (If I have forgotten you, I apologize!).


PlanetMySQL Voting: Vote UP / Vote DOWN

International Women’s Day

Март 8th, 2010

If you do not know what International Women’s Day is: http://www.internationalwomensday.com/

Start planning your blog posts for Ada Lovelace day now (March 24th, http://findingada.com/ Ada Lovelace Day is an international day of blogging (videologging, podcasting, comic drawing etc.!) to draw attention to the achievements of women in technology and science.)

To that end, I would like to point out all the women currently in science and tech fields that I admire and think are doing great things. I think it would be great if everyone, male or female, made a list like this:

The women that have taught me science/tech along the way:

High School:
Mary Lou Ciavarra (Physics)
Maria Petretti (Pre-Algebra, and Academic Decathlon)
Reneé Fishman (Biology)
Lisa Acquaire (Economics during Academic Decathlon)

College:
Professor Kalpana White (Biology), and in whose fruit fly lab I worked for 2 semesters.
Professor Eve Marder (Introductory Neuroscience)

Though Brandeis does have female faculty in the Computer Science department, I did not manage to have any classes with female Computer Science faculty members.

My current female DBA co-workers at Pythian: Isabel Pinarci (Oracle), Michelle Gutzait (SQL Server), Catherine Chow (Oracle) and Jasmine Wen (Oracle).

And to folks in the greater MySQL/tech community and tech co-workers past and present, especially those I have been inspired and helped by: Tracy Gangwer, Selena Deckelmann (Postgres), Amy Rich, Anne Cross, and more (If I have forgotten you, I apologize!).


PlanetMySQL Voting: Vote UP / Vote DOWN

Log Buffer #181: a Carnival of the Vanities for DBAs

Март 5th, 2010

The 181st edition of Log Buffer has been published by Gary Myers on his Sydney Oracle Lab.

Having recently moved his blog, Gary approached the Log Buffer coordinator to volunteer for an edition because he knows that, with LB being a popular and established destination in the database blogoshphere, it would help him broadcast his new blog and welcome readers to it. You can do it too–simply send an email to the Log Buffer coordinator.

Here’s Gary’s Log Buffer #181.


PlanetMySQL Voting: Vote UP / Vote DOWN

Log Buffer #181: a Carnival of the Vanities for DBAs

Март 5th, 2010

The 181st edition of Log Buffer has been published by Gary Myers on his Sydney Oracle Lab.

Having recently moved his blog, Gary approached the Log Buffer coordinator to volunteer for an edition because he knows that, with LB being a popular and established destination in the database blogoshphere, it would help him broadcast his new blog and welcome readers to it. You can do it too–simply send an email to the Log Buffer coordinator.

Here’s Gary’s Log Buffer #181.


PlanetMySQL Voting: Vote UP / Vote DOWN

Log Buffer #180: a Carnival of the Vanities for DBAs

Февраль 26th, 2010

Hello and welcome to Log Buffer #180. Time’s a-wastin’, so let’s go!

Oracle

There was so much Oracle stuff this week that I’ve decided to cram a little more of it into Log Buffer by providing a little less context than usual.

Jonathan Lewis shares an explication of aliases: “I was asked the following question recently: ‘Does the use of table aliases affect performance?’ To which the best answer is probably ‘Yes, though in general you probably won’t notice the difference and there are reasons more imporant [sic] than performance for using table aliases.’”

Doug Burns continues his most recent series: Statistics on Partitioned Tables – Part 2, and Statistics on Partitioned Tables – Part 3.

Charles Schultz demonstrates how VPD + bad ANYDATA practices can really bite: “The point of my blog was that using CAST can really screw up your data. Oracle Support is filing a bug on this behavior, as it looks like an overflow problem.”

Pythian’s Gleb Otochkin begins a series on Oracle GoldenGate installation.

Guy Harrison provides a thorough introduction and recommendations on memory management for Oracle databases on VMWare ESX.

Robert Vollman returns to blogging and offers his 10-point plan on improving your SQL queries.

Jared Still sheds some light on a cool but unknown RMAN feature.

Richard Foote knocks holes in another myth: “One of the great myths in Oracle is that bitmap indexes are only suitable and should only be used with columns that have so-called low cardinality (few distinct) values.

Alexander Kornbrust shares a link to a really good whitepaper about “Hacking Oracle from the Web” by Sumit Siddarth.

Eddie Awad shares a link to a SQL injection prevention cheat sheet.

Charles Hooper answers the question, What is the meaning of the %CPU column in an explain plan?.

Meanwhile, Harald van Breederode does the same for this one: Why does the size of my ORACLE_HOME increase?

SQL Server

Thomas LaRock gives an recap of MS’s 2010 MVP Summit. Quotable take-away: “If I had to compare SQL 2008 R2 to SQL Server 4.0, I would say the difference is the same as comparing an F1 race car to a Chevy Vega.”

Half a world away, there is the SQLSocial Event – London March 16th, as advertised by Simon Sabin.

Simon also shares a script to get indexes and their included columns, beginning, “I get increasingly frustrated with the lack of visibility of included columns in management studio and from the system stored procedures sp_… This is a query that returns all indexes and there key and include columns[.]”

Andy Leonard throws us another nourishing SSIS snack: conditional split outputs.

Here’s Rob Farley with a book review of an oldie but a goodie: Inside SQL 2005 Query Tuning and Optimization, by Kalen Delaney et al. “If you spend any time tuning SQL Server databases, then this book will feel much thicker than it really is, and you’ll be finding useful information on just about every page.”

Thomas LaRock, meanwhile, writes that SQL Server 2008 Query Performance Tuning Distilled is
a good way to start your day. “Each morning, while I wait for my desktop to boot, I pick up their book, turn to any page, and just start reading.”

MySQL

Sticking with the theme a little longer, here is Baron Schwartz with a review of Understanding MySQL Internals by Sasha Pachev. “I should have read this book a long time ago, and it’s my loss that I didn’t.  . . .  Overall, this book is easily a high 4 stars on a scale of 5, and again, anyone seriously using MySQL should have it.”

Baron also shares a link to Oracle guy Cary Millsap’s Thinking Clearly about Performance paper.

Brian “Krow” Aker starts an extensive conversation with his post, Protocols, The GPL, Influences from MySQL. His thesis, “MySQL was the company that had the most influence on how companies and investors viewed the GPL.”

Paul Vallée of Pythian responds with his ideas on product management, effective developers, and the future of MySQL. “ . . . the future of MySQL, Drizzle, Monty Program, the Percona fork, etc.” to be more precise.

Colin Charles provides news of what’s been happening recently in MariaDB #1.

Mohammad Lahlouh wonders, can I use latin1 to store utf8 data? and gets several answers from his readers.

He might have asked Ronald Bradford, who knows this stuff. Here is his post on migrating MySQL latin1 to utf8 – character set options.

Pursuing a similar matter (collations), Roland Bouman opines, the best stored routine is the one you don’t write.

PostgreSQL

Baron Schwartz again! He announces, mk-query-digest now supports Postgres logs.

David Fetter says, part(ition)ing is such sweet sorrow. “There are excellent references on partitioning tables that depend on one table, but what happens when you need to partition the referenced table? Let’s find out!”

Bruce Momjian is here with news on the Python driver confusion.

Jon Jensen of End Point’s Blog posts a HOWTO on PostgreSQL EC2/EBS/RAID 0 snapshot backup.

NoSQL, Etc.

Chen Shapira has been at the compass and protractor, mapping the NoSQL space and returns from terra incognita unscathed.

Ronald Bradford has been getting started with Cassandra, one of the outposts on Chen’s map, and shares his steps.

Arnie Rowland says, “Mark your calendar! Portland SQLSaturday/CodeCamp/Barcamp 2010 is scheduled for May 22, 2010, at the University of Portland campus.  . . .  Portland SQLSaturday is encouraging presentations related to interoperability of any of the SQL platforms, including T-SQL (SQL Server), PostgreSQL, MySQL, and PL-SQL. Abstracts for Platform specific sessions are also encouraged.”

Okay, that is all for this edition. You guys are running me ragged! Fortunately, Gary Myers picks it up next week on his Sydney Oracle Lab. Till then!


PlanetMySQL Voting: Vote UP / Vote DOWN

Cary Millsap: Thinking Clearly about Performance

Февраль 23rd, 2010

Cary Millsap has a concise, readable paper on performance. Anyone involved in database performance optimization should read it. Cary’s writing has heavily influenced my mk-query-digest tool for analyzing MySQL/PostgreSQL/Memcached/HTTP query performance, and I think you’ll get a lot more from mk-query-digest if you read this paper — and you should also read his book, reviewed here. It’s one of the top books on my Essential Books List.

Related posts:

  1. A review of Optimizing Oracle Performance by Cary Millsap Optimizing
  2. Sessions of interest at the Percona Performance Conference Having wri
  3. An ongoing thread of blogs on MySQL performance In the las

Related posts brought to you by Yet Another Related Posts Plugin.


PlanetMySQL Voting: Vote UP / Vote DOWN

Learn about mk-query-digest at PgEast 2010

Февраль 20th, 2010

I’ll be attending PgEast this year, as I’ve done for the last couple of years, and this year I’ll also be speaking. The topic is query analysis with mk-query-digest. The official description of my talk is as follows:

mk-query-digest is a powerful open-source tool for capturing, filtering, transforming, and aggregating queries, with the ability to do all sorts of other advanced tasks too. By default, it aggregates similar queries together and presents a designed-for-DBAs report with statistics about the most important queries, so you can see where to focus your optimization efforts. This talk shows you how to use mk-query-digest to analyze your Postgres server’s workload.

PgEast 2010 has an impressive lineup of talks, which isn’t even complete yet. I’ve thoroughly enjoyed this conference in the past, and it looks like this year will be better than ever.

Related posts:

  1. mk-query-digest now supports Postgres logs Maatkit do
  2. mk-query-digest now understands HTTP You used t
  3. Learn about Maatkit at the MySQL Conference I’m

Related posts brought to you by Yet Another Related Posts Plugin.


PlanetMySQL Voting: Vote UP / Vote DOWN

mk-query-digest now supports Postgres logs

Февраль 20th, 2010

Maatkit does more than just MySQL. I’ve just committed a new version of mk-query-digest, a powerful log analysis tool, with support for Posgtres logs, in both syslog and stderr format. I’m hoping that people will give this a spin in the real world. I have lots of test cases but that’s never enough.

A brief tutorial:

# Get it
$ wget http://www.maatkit.org/trunk/mk-query-digest

# Run it
$ perl mk-query-digest --type pglog /path/to/log/file

# Learn about it (search for the string "pglog")
$ perldoc mk-query-digest

I’m going to close comments on this blog post so I don’t get bug reports in the comments. If you have feedback, please post it to the Maatkit mailing list, or the Maatkit issue tracker. Or reply to the thread I just started on the Postgres mailing list.

Related posts:

  1. mk-query-digest now understands HTTP You used t
  2. I’m a Postgres user, as it turns out Someone re
  3. How MySQL really executes a query WARNING: n

Related posts brought to you by Yet Another Related Posts Plugin.


PlanetMySQL Voting: Vote UP / Vote DOWN

Log Buffer #179: a Carnival of the Vanities for DBAs

Февраль 19th, 2010

You have found the 179th edition of Log Buffer, the weekly review of database blogs. Welcome. Enjoy your stay. We begin with . . . 

SQL Server

Merrill Alrich gets going with a fresh juxtaposition–his thoughts on motorcycles and access. “Many DBAs,” he writes, “have been called in to rescue people, or teams, or projects who have mission critical Access applications gone horribly wrong. It’s very unpleasant, especially the typical discussion we have to have with the Access afficionado . . . ”

Brent Ozar is in on this discussion too. Here he gives his top 10 reasons why access still doesn’t rock.

Brent’s blog also has an interview with Joe Sack, “ . . . public face for the SQL MCM program.”

Aaron Bertrand has a couple new items in his Bad Habits to Kick series: inconsistent table aliasing and blind SQL Server installs.

The Rambling DBA Jonathan Kehayias, advises, pay attention to maintenance cleanup job configuration, specifically with regard to backup files.

Jeremiah Peschka show how a simple refactoring of functions in the WHERE clause can turn a performance disaster into success. He says, “Putting functions in the where clause of a SQL Statement can cause performance problems. If you aren’t careful, these problems can add up and bring a powerful production system to its knees.  . . .  [If] we move the function from the table to the variable . . . [this] is going to be orders of magnitude faster . . . ”

John Paul Cook introduces his favorite free tool, SQL Heartbeat. “There are many great scripts available,” writes John, “for monitoring SQL Server. But admit it, don’t you miss animation and color? I’ve been using SQL Heartbeat for about two weeks and I really appreciate the animation and color. These features make the output highly effective and easy to grasp.”

MySQL

Mark Callaghan gets our MySQL stuff started. “What is the future for MyISAM?” He asks. “MySQL has invested a lot in storage engines over the past few years (Falcon, Maria) and it isn’t clear that anything will come from those efforts. A lot of effort has been put into InnoDB and much will come from that. There has not been a significant effort to improve MyISAM (other than hot backup). What could be done with it?” The post is Save MyISAM.

The ThetaJoin Blog features a review of the MySQL Administrator’s Bible. The author writes, “On the front cover of MySQL Administrator’s Bible is a sentence that reads: ‘The book you need to succeed!’ I must say, I do agree.”

On the MySQL Performance Blog, Vadim compares READ-COMMITED vs REPETABLE-READ in tpcc-like load. “[The] question, what is better isolation level is poping up again and again. Recently it was discussed in InnoDB : Any real performance improvement when using READ COMMITED isolation level ? and in Repeatable read versus read committed for InnoDB . Serge in his post explains why READ COMMITED is better for TPCC load, so why don’t we take tpcc-mysql bencmark and check on results.”

Giuseppe Maxia, the Data Charmer, reports on the Linux MySQL distros meeting in Brussels. He says, “When I saw Shlomi’s post on why not to use apt-get or yum for MySQL, I thought immediately that his conclusions are quite reasonable. What you get from the Linux distributions is not the same thing that you find in the official MySQL downloads page.  . . .  We at the MySQL team have organized a meeting with the Linux distributions with the intent of finding out which differences and problems we may have with each other, and to solve them by improving communication.”

Speaking of distros, here is Falko’s How To Set Up MySQL Database Replication With SSL Encryption On CentOS 5.4 on HOWTOForge.

PostgreSQL

Meanwhile, in the PG ’sphere, Jeremiah Peschka walks through installing PostgreSQL on Mac OS X, “ . . . a pretty simple process, but one that I thought I would document because I ran into a few gotchas along the way.”

Leo Hsu and Regina Obe introduce the subject of regular expressions in PostgreSQL, on the Postgres OnLine Journal. “Every programmer,” they say, “should embrace and use regular expressions (INCLUDING Database programmers). There are many places where regular expressions can be used to reduce a 20 line piece of code into a 1 liner. Why write 20 lines of code when you can write 1.”

Marc Balmer’s PostgreSQL blog looks into using PostgreSQL to decouple applications, or, OO meets SQL (Part I): “What works at the object level in an object oriented programming language can be applied to distributed PostgreSQL applications as well, allowing for proper decoupling of applications or application parts. The tools needed for this are PostgreSQL’s trigger procedures and the asynchronous notification mechanism. In this first installment of a small series of blog entries, I will talk about the basics needed to implement a distributed decoupled software solution.”

Oracle

The Oracle world is full of sturm und drang this week. Accusations, recriminations and enigmas–of the technical kind–abound.

Kerry Osborne asserts that autotrace lies too, following up an older post of his on explain plan’s untruths: “Autotrace is another commonly used tool that suffers from the same basic character flaw (i.e. it lies too). The reason is simple. It uses the Explain Plan command under the covers.”

Richard Foote, for his part, looks for answers to this puzzler: how does an execution plan suddenly change when the statistics (and everything else) remains the same?

Charles Hooper, meanwhile, wonders aloud, V$FILESTAT is Wrong?

According to James Morle, The Oracle Wait Interface Is Useless (sometimes), the third in a series. “In this part I will finally get to the point, and talk about some alternative techniques for determining the reasons for poor performance for our example user session.” Pythian’s Mark Brinsmead says this is “hyper-cool.”

On The Blog from the DBA Classroom, Joel Goodman shows how to tell RAC to leave your leaves alone. “One use of sequences about which most DBAs and developers are aware is that of generating primary keys for a table and naturally the key data for the underlying index.  . . .  But the choice of sequence parameters can have an effect on performance when using high volume insert applications in a RAC database environment. The problem is one of Index Leaf Block contention . . . ”

Doug Burns is here with the first part of a series on statistics on partitioned tables, which he introduces thus: “We’ve encountered a few problems at work recently and I decided it would be an idea to put together a series of posts covering the basics of what can become quite an involved topic because it’s not difficult to find yourself going round in circles reading the documentation, Oracle Support Notes, blog posts, forum threads and the rest until you don’t know whether you’re coming or going!”

Martin Widlake has a thoughtful post on the DBA and his or her job, called, Making Things Better Makes Things Worse. He say, “I’m encountering a phenomenon that I have talked about with Dennis Adams a couple of times. It probably has a proper term, but basically it is the odd situation that when you make things better, you get more complaints.”

That’s all for this issue. See you for LB#180!


PlanetMySQL Voting: Vote UP / Vote DOWN

Log Buffer #179: a Carnival of the Vanities for DBAs

Февраль 19th, 2010

You have found the 179th edition of Log Buffer, the weekly review of database blogs. Welcome. Enjoy your stay. We begin with . . . 

SQL Server

Merrill Alrich gets going with a fresh juxtaposition–his thoughts on motorcycles and access. “Many DBAs,” he writes, “have been called in to rescue people, or teams, or projects who have mission critical Access applications gone horribly wrong. It’s very unpleasant, especially the typical discussion we have to have with the Access afficionado . . . ”

Brent Ozar is in on this discussion too. Here he gives his top 10 reasons why access still doesn’t rock.

Brent’s blog also has an interview with Joe Sack, “ . . . public face for the SQL MCM program.”

Aaron Bertrand has a couple new items in his Bad Habits to Kick series: inconsistent table aliasing and blind SQL Server installs.

The Rambling DBA Jonathan Kehayias, advises, pay attention to maintenance cleanup job configuration, specifically with regard to backup files.

Jeremiah Peschka show how a simple refactoring of functions in the WHERE clause can turn a performance disaster into success. He says, “Putting functions in the where clause of a SQL Statement can cause performance problems. If you aren’t careful, these problems can add up and bring a powerful production system to its knees.  . . .  [If] we move the function from the table to the variable . . . [this] is going to be orders of magnitude faster . . . ”

John Paul Cook introduces his favorite free tool, SQL Heartbeat. “There are many great scripts available,” writes John, “for monitoring SQL Server. But admit it, don’t you miss animation and color? I’ve been using SQL Heartbeat for about two weeks and I really appreciate the animation and color. These features make the output highly effective and easy to grasp.”

MySQL

Mark Callaghan gets our MySQL stuff started. “What is the future for MyISAM?” He asks. “MySQL has invested a lot in storage engines over the past few years (Falcon, Maria) and it isn’t clear that anything will come from those efforts. A lot of effort has been put into InnoDB and much will come from that. There has not been a significant effort to improve MyISAM (other than hot backup). What could be done with it?” The post is Save MyISAM.

The ThetaJoin Blog features a review of the MySQL Administrator’s Bible. The author writes, “On the front cover of MySQL Administrator’s Bible is a sentence that reads: ‘The book you need to succeed!’ I must say, I do agree.”

On the MySQL Performance Blog, Vadim compares READ-COMMITED vs REPETABLE-READ in tpcc-like load. “[The] question, what is better isolation level is poping up again and again. Recently it was discussed in InnoDB : Any real performance improvement when using READ COMMITED isolation level ? and in Repeatable read versus read committed for InnoDB . Serge in his post explains why READ COMMITED is better for TPCC load, so why don’t we take tpcc-mysql bencmark and check on results.”

Giuseppe Maxia, the Data Charmer, reports on the Linux MySQL distros meeting in Brussels. He says, “When I saw Shlomi’s post on why not to use apt-get or yum for MySQL, I thought immediately that his conclusions are quite reasonable. What you get from the Linux distributions is not the same thing that you find in the official MySQL downloads page.  . . .  We at the MySQL team have organized a meeting with the Linux distributions with the intent of finding out which differences and problems we may have with each other, and to solve them by improving communication.”

Speaking of distros, here is Falko’s How To Set Up MySQL Database Replication With SSL Encryption On CentOS 5.4 on HOWTOForge.

PostgreSQL

Meanwhile, in the PG ’sphere, Jeremiah Peschka walks through installing PostgreSQL on Mac OS X, “ . . . a pretty simple process, but one that I thought I would document because I ran into a few gotchas along the way.”

Leo Hsu and Regina Obe introduce the subject of regular expressions in PostgreSQL, on the Postgres OnLine Journal. “Every programmer,” they say, “should embrace and use regular expressions (INCLUDING Database programmers). There are many places where regular expressions can be used to reduce a 20 line piece of code into a 1 liner. Why write 20 lines of code when you can write 1.”

Marc Balmer’s PostgreSQL blog looks into using PostgreSQL to decouple applications, or, OO meets SQL (Part I): “What works at the object level in an object oriented programming language can be applied to distributed PostgreSQL applications as well, allowing for proper decoupling of applications or application parts. The tools needed for this are PostgreSQL’s trigger procedures and the asynchronous notification mechanism. In this first installment of a small series of blog entries, I will talk about the basics needed to implement a distributed decoupled software solution.”

Oracle

The Oracle world is full of sturm und drang this week. Accusations, recriminations and enigmas–of the technical kind–abound.

Kerry Osborne asserts that autotrace lies too, following up an older post of his on explain plan’s untruths: “Autotrace is another commonly used tool that suffers from the same basic character flaw (i.e. it lies too). The reason is simple. It uses the Explain Plan command under the covers.”

Richard Foote, for his part, looks for answers to this puzzler: how does an execution plan suddenly change when the statistics (and everything else) remains the same?

Charles Hooper, meanwhile, wonders aloud, V$FILESTAT is Wrong?

According to James Morle, The Oracle Wait Interface Is Useless (sometimes), the third in a series. “In this part I will finally get to the point, and talk about some alternative techniques for determining the reasons for poor performance for our example user session.” Pythian’s Mark Brinsmead says this is “hyper-cool.”

On The Blog from the DBA Classroom, Joel Goodman shows how to tell RAC to leave your leaves alone. “One use of sequences about which most DBAs and developers are aware is that of generating primary keys for a table and naturally the key data for the underlying index.  . . .  But the choice of sequence parameters can have an effect on performance when using high volume insert applications in a RAC database environment. The problem is one of Index Leaf Block contention . . . ”

Doug Burns is here with the first part of a series on statistics on partitioned tables, which he introduces thus: “We’ve encountered a few problems at work recently and I decided it would be an idea to put together a series of posts covering the basics of what can become quite an involved topic because it’s not difficult to find yourself going round in circles reading the documentation, Oracle Support Notes, blog posts, forum threads and the rest until you don’t know whether you’re coming or going!”

Martin Widlake has a thoughtful post on the DBA and his or her job, called, Making Things Better Makes Things Worse. He say, “I’m encountering a phenomenon that I have talked about with Dennis Adams a couple of times. It probably has a proper term, but basically it is the odd situation that when you make things better, you get more complaints.”

That’s all for this issue. See you for LB#180!


PlanetMySQL Voting: Vote UP / Vote DOWN