Cube design - ROLAP considerations vs. MOLAP

2k views Asked by At

Does anyone have resources that give a list of things to consider when designing a ROLAP cube, as opposed to MOLAP (I'm doing it in Pentaho, but I guess the principles are not dis-similar for other implementations). For example, I'm thinking of things like:

  1. should extra transformational work be done at the ETL stage to reduce computational work when querying the cube?

  2. should all my dimension tables be in the same database as my cube?

3

There are 3 answers

0
Feris Thia On BEST ANSWER

I'm a Pentaho implementor in Indonesia. First, of course you should try to aggregate all your measures group by surrogate keys involved.

And in Mondrian, you can "cache" some computations using additional aggregate tables. You can do it in Pentaho Aggregate Designer. But after that you will need additional work in your data warehouse / ETL stage.

Regards,

Feris

http://pentaho-en.phi-integration.com

0
davek On

Thanks to Feris for the link and input, but in the end I went for this book:

http://www.amazon.com/Pentaho-Solutions-Business-Intelligence-Warehousing/dp/0470484322/ref=sr_1_1?ie=UTF8&s=books&qid=1258408259&sr=8-1

I had a good long look at the Mondrian site + docs, but the book seems more comprehensive.

1
KenFar On

First off - the designs are similar but they are driven by different performance & scalability strategies.

Secondly - the etl process is pretty much the same. Except - you'll typically see a lot more data in a rolap cube than a molap cube because of scalability features in relational databases. And you'll often see a rolap cube within a non-rolap database (warehouse, even transactional database) that does more than just support rolap.

Lastly, you'll typically generate aggregate table if you've got much data volume. That aggregation can be done a lot of different ways, but I'd say it is not typically driven by your ETL process unless you lack the ability to manage a separate asychronous process or have data volumes that make it impractical to run period summary jobs.