Writing Database Code In Rails: Migrations and Neutrility
Posted Wed, 23 Nov 2005 17:12:24 GMT to Posted in Programming Tags migrations, rails, ruby
As I recently said, I switched TigerEvents over from using a MySQL input file to migrations. There were many decisions behind this choice.
- I wanted the code to be database neutral. Using migrations would allow me to automatically support MySQL, PostgreSQL, and SQLite by making changes to one file instead of 3.
- I know that the database for TigerEvents is going to change at some point in the near future. Rather than write multiple upgrade scripts manually, I could just use the built in migration functions to do it better.
- If for some reason I make a mistake and problems occur on the production site, it is easier to roll back to a previous version using migrations than dump the database, roll back the code, reload the database, etc. (Note, we still make sure to back up the database, cause not doing that is just tempting fate)
- Migrations will, in theory, make development easier. Someone made a change to the schema? Just run rake migrate, and your database is automatically updated. I think this is easier than reinitializing a database and then loading it with data repeatedly (though the second part of this can be handled with Fixtures).
While an awesome introduction to getting started with Migrations has already been written, I feel that the ‘database neutral’ stuff is missing. Using my own experience, several items must be taken into account, mostly due to MySQL usage and certain programming that were being used.
Already Existant Database
So we already had a database. A somewhat large database with a number of tables. I didn’t want to spend a lot of time manually creating the migration code from scratch. Was there a quicker way? You betcha. There was a rake task called db_schema_dump which will dump all the table data from your database to a file called schema.rb. So, once we do that, all we have to do is copy this information to the self.up section of a migration and we are done, right? Unfortunatly, it is NOT that easy. The problem is (in my case) that MySQL data types do not necessarily behave the same. Mostly, this deals with constraints, but also has to do with the boolean data type itself.
Constraints
In MySQL, you can do int(8). In rails, this would be translated into something like:integer, :limit => 8. However, PostgreSQL only has smallint, integer, and bigint, not constraint values. Therefore, the previous rails code returns an error, as PostgreSQL chokes on the SQL command that rails sends it. Therefore, while it is true that you don’t need to know specific SQL syntax, you DO need to know what kind of data you can feed various databases. For TigerEvents, I just generalized items and made the following changes:
:integer, :limit => 8 became :integer
:integer, :limit => 1 which translated to tinyint(1) in MySQL became :boolean
:timestamp became :datetime (which gave me extra functionality). Now, I by no means know if these are the optimal solutions, but they did work.
Boolean Data
We have a few boolean flags in our database. However, MySQL does not have a boolean data type. Instead, tinyint(1) is generally used. Well, generally that is ok, as ActiveRecord will automatically use tinyint(1) when connecting to MySQL and boolean when connecting to PostgreSQL and SQLite. However, since we started out with MySQL, there were numerous statements such as@newgroups = Group.find(:all, :conditions => ["approved = 0"])@newgroups = Group.find(:all, :conditions => ["approved = ?", false])So once again, while you don’t need to know specific SQL query usage, it is important to keep data types general, and passed values abstracted if you want to create database code which is database neutral (at least for MySQL, PostgreSQL, and SQLite).

I don’t understand it, but fuck if that isn’t sexy as hell.
Honey, you’re every geek’s dream. And Anne, it’s cool that you support him.
=P
Hey, have you noticed whether the active migration supports ENUM datatypes, or what I should use instead of enums? I like using enums for status flags. But when I try to db_schema_dump by table with enums in it, it chokes.
I am having a similar problem with blob data abstraction. I want to store large-ish files in my database and migration converts it to plain ol “blob” which chokes on my files. MySQL is expecting LONGBLOB but I am not sure how/if I can you your limit example to accomplish the same thing.
Thanks!
Gary, I unfortunatly haven’t tried using ENUM datatypes yet. Of course, in the near future, I am most likely going to require them. Ah…..... refactoring. The following page http://wiki.rubyonrails.com/rails/pages/HowtoUseSetAndEnumColumns says that ENUM is not supported, but gives some workarounds (I haven’t tried them).
heavysixer, like ENUMS, I haven’t used blob data either. http://wiki.rubyonrails.com/rails/pages/HowtoStoreFilesInTheDatabase indicates they can be used, but isn’t very verbose. The apidocs seem to indicate that it is binary data (http://api.rubyonrails.com/classes/ActiveRecord/ConnectionAdapters/MysqlAdapter.html) under native_database_types(), but I am not sure if you can use limit. My best guess would be to use :limit => 4294967300. This is based off reading the MySQL docs on storage size, where longblob storage is L+4 where L
“However, PostgreSQL only has smallint, integer, and bigint, not constraint values.”
Thanks. I have been banging my head for a couple of hours trying to figure out why
table.column :myint, :integer, :limit => 10
wasn’t working with Postgresql. I wonder if these limitations are all documented somewhere or if you just have to figure it out from the DB you are working with.
Eden: I am fairly certain this requires a certain level of familiarity with the database you are using, though a single page which has these items, and ties them in with what is possible for migrations would be nice.
Nice for the schema. But how to migrate the data from one db to another as well? Say from MySQL to Sqlite?
[...] http://torch.cs.dal.ca/~ssmith/?p=41 [...]
Hi!
... concerning your LARGEBLOB problems.. just use:
t.column :data, :binary, :limit => 10.megabyte
in your migration to define bigger BLOBs-aaalex