Feed on
Posts
Comments

Archive for the ‘data’ Category

Amazon today announced that later this year, Windows Server woud be available on EC2. No details on cost and licensing etc. but this is major.  Up until now, that portion of the business world who are pure MS shops (a very large percentage especially amongst SMEs) were excluded from taking advantage of Amazon’s amazing (and [...]

Read Full Post »

I’m a fan of most things Web2.0, not just for personal use but as business tools.  Over the last four years or so I’ve enthusiastically embraced Wikis, IM (Google Talk), RSS Readers et al. I could see the benefit and attraction of social network sites such as Facebook even if I’ve not partaken as such. [...]

Read Full Post »

Just back from a break in Clifden, Connemara, summer is nearly over, the kids return to school today, back to work.
Counties Galway and Mayo were like the rest of the country last week, a tad wet, but unlike the developed east of the island, flooding was not a problem; a problematic drainage area is called [...]

Read Full Post »

… well, at least for me.  Let me explain.
For most of my datasmithing career, I’ve had access to corporate Oracle databases and now with the availability of  Oracle10g  Express I can even run my own Oracle instances at home or on EC2.  The combination of a powerful SQL engine, expressive scripting language (PL/SQL) ,OS independence, [...]

Read Full Post »

Although I had decided to use Talend (Java version) as my primary datasmithing tool I still had one major problem with it, its lack of a scripting tool.  Kettle (Pentaho PDI) has Javascript, Excel has VBA, Picalo has (well OK, is) Python and Talend in its Perl version has Perl.  I could have gone (and [...]

Read Full Post »

Image via Wikipedia
Although my data-smithing tool box is full to the brim with powerful tools such as Talend, Kettle PDI, Picalo and Excel, all backed by the cloud infrastructure of Amazon’s S3, SImpleDB and EC2, there’s one simple yet powerful tool that I always seem to gravitate back to, that tool is SQLite.
Now obviously being [...]

Read Full Post »

These last two weeks, Python has been on my mind. First off, last week I decided to make time to fully investigate Picalo, an open-source Python-based data analysis tool, and then, this week, Google announced their long awaited cloud-computing offering, Google Apps Engine, with the language at its core.
Python was the first of [...]

Read Full Post »

Friday, last week, 15th Feb, two of the services I most depend on, failed. Now as it turned out, neither really concerned me at the time, as that same day my brother was taken seriously ill (he’s now doing fine and on the way to recovery). It’s only now I’ve had the time [...]

Read Full Post »

What if you’re a major player in the IT world and suddenly the internet’s equivalent of your local bookshop releases a mould-breaking cloud-based database service, SimpleDB. This is on top of Amazon’s highly acclaimed document data store service, S3!
Well, if you’re IBM you hire Damien Katz the person behind CouchDB. I think 2008 [...]

Read Full Post »

While discussing SimpleDB ,Nick Carr points to the polar opposite views that the two computing behemoths, Google and Microsoft, hold as to the future direction of cloud computing. Google’s Schmidt sees an eventual 90/10 split with the cloud being the home to most data and processes while as expected, Microsoft’s Raikes points to the [...]

Read Full Post »

I’m a database man. I’ve worked on or about most variations on the theme, from roll-your-own flat files, to hierarchical, to CODASYL network databases, to the current crop of relational and MOLAP platforms. Of late, I’ve being investigating what I think will be the future of database technology, the distributed document-centric database. [...]

Read Full Post »

DATA + HMRC = GUBU ?

I was tempted to use the 1980’s era Irish political acronym GUBU (Grotesque, Unbelievable, Bizarre and Unprecedented ) to describe the announcement by Chancellor Darling yesterday of the loss of 25 million UK citizens’ data records. Grotesque yes; bizarre - putting 25 million private records on two un-encrypted CD/DVD disks and sending it to London, [...]

Read Full Post »

I’ve said it before and I’m going to repeat myself; learning Ruby has proven to be a great investment, not so much for the language itself but for the insights it gives into other technologies. As soon as a new ‘cool’ technology or idea hits the street some smart Rubyist is bound to attack [...]

Read Full Post »

You know when you come across something so simple, so obvious and so brilliant you wonder, why didn’t I think of that? Well for personal/small business data backup I’ve just had one of those moments.
CrashPlan is a consumer/SMB orientated backup service following in the footsteps of Mozy (a service I’ve used in the past [...]

Read Full Post »

Let there be no doubt about it, Amazon’s S3 online storage system is wonderful; it’s secure (both from an technology point of view and from Amazon’s status as one of the web’s most trusted sites i.e. one you wouldn’t worry about giving your credit card to), it’s cheap, it’s pay-as-you-go and it has first mover [...]

Read Full Post »

CouchDB - document centric ODS

While the potential of column-oriented DBMSs within BI projects is obvious given the popularity of MOLAP ( a form of column-oriented data store) the potential for the other new kid on the block, the document-oriented database, is less so. One such DBMS,CouchDb, is the latest wunderkid to bubble to the surface, helped by the [...]

Read Full Post »

According to Michael Stonebraker , one of the pioneers of relational database technology, the future of  DBMSs lies with column-oriented databases such as C-Store or Google’s BigTable. In the BI sphere, MOLAP column-oriented data-stores are increasingly the norm. But the fact table implementations of most ROLAP star-schemas tend to favour a row-oriented “wide [...]

Read Full Post »

In Memory OLAP

The consolidation within the BI market continues, this time with the purchase of Applix by Cognos. As Timo Elliott points out, the interesting bit is the Applix TM1 memory-centric OLAP product. For the vast majority of OLAP users (i.e. the millions of Excel Pivot table jockeys) in-memory OLAP is nothing new, but traditionally [...]

Read Full Post »

Moved to blog.gobansaor.com

Over the weekend I transferred this blog over to my own sub-domain, http://blog.gobansaor.com. The blog continues to be hosted by WordPress.com and the old http://gobansaor.wordpress.com addresses will continue to work. Most RSS readers will also gracefully (I hope) handle the transfer of the RSS feed, but if not, you may wish to [...]

Read Full Post »

Web Offline - all data lost!

…just a warning, get a life and get a data-backup strategy

Read Full Post »

Older Posts »