Skip to main content

What is my thesis all about

The last weekend I discussed in a forum what my thesis is about. I spent some time in the answer, so why not posting it in my blog to give people a better idea what I'm working on:

When users first see Palo and Pentaho, they don't really know there is such a big difference, both claim to be a BI suite. It was the same for me, when I first started my thesis.
In my thesis I'm working with Palo OLAP Server, Palo Worksheet Server and also with Palo ETL Server compared to the Pentaho BI Suite (without Weka), otherwise it wouldn't be a good comparison. I created a test scenario, and try to provide a solution with both Palo and Pentaho.

The focus of my thesis is not to say, Palo is good, Pentaho isn't or the other way around. the result might be "if you focus on planning, you better use Palo, if you do lots of reporting, Pentaho provides more options,...

The scenario is more the practical side of an implementation: How to get data into the system (ETL), how can you design your data (Modeller, Schema Workbench, Metadata Editor,...), how can you create reports, especially parameterized and time related ones, how can you manage users and groups and their rights, do the tools provide planning posibilities. Are there special features? Since there is so much stuff to look at I can't always go into all the details of course.


Popular posts from this blog

Pentaho Data Integration - Multi-part Form submission with file upload using the User Defined Java Class Step

I recently needed to use Pentaho Data Integration (PDI) to send a file to a server for processing using HTTP Post. I spent several hours trying to use the existing steps HTTP Post, HTTP Client & Rest Client but I couldn't get it to work. After some more research I came across the issue PDI-10120 - Support for Multi-part Form Submittal In Web Service Steps  and I thought I was out of luck. I previously wrote a small Java client for a similar use case and remembered the PDI has a step called User Defined Java Class  (UDJC). After reading this great tutorial I created the following basic transaction. I have a dataset with the URL and the full file path and use the UDJC to make the HTTP call. HTTP Post using User Defined Java Class The Java class handles the actual HTTP Post. It uses 2 input variables, the URL (url) which is used for the call and the file name (longFileName). The HTTP call then contains the file (line 30) and the file name (line 31). I included some basi

Products you don't expect to be 'Made in China' - Del Monte fruit cups

Since I moved to Canada back in March I have started to realize how many products are actually made in China. Back in Germany you could also buy lots of stuff from China but you mostly had the choice between German or Europe products and Chinese products. When I went to Food Basics in Oakville a couple weeks ago to get some apples I stood in front of a huge tray of Chinese apples! Aren't there enough apples in Ontario, Canada or the US? Even Mexico would probably be closer than China. Another day my wife bought Del Monte fruit cups in the grocery store. I checked the label when I was going to eat it and i decided to leave it in the fridge. First of all it is 'Made in China' (again I guess no other country in this world has fruit) and second it contains artificial flavor. How bad must the fruit inside be that you need artificial flavor (and does anybody in China controls how it is made)? For my part I'll check the labels more closely whenever I buy any kind of product

Connect Facebook & (almost) everything from Google

Yesterday I tried to figure out how to post my blog posts on my Facebook profile. First I tried different application but all they could do was adding a box on your profile. After searching for on google for a couple minutes I found one video on youtube that lead me the way: Import Blog/RSS in Facebook profile . You can do it even easier as in the video. When you are on your profile in Facebook, you just have to click on Import (as shown in the picture) and you can import not only a RSS feed from your blog, but also integrate lots of different websites like Youtube, Google Reader and most important for me: Picasa. Finally you can use all the different great tools from Google and integrate them all into one Profile. UPDATE: Finally there is also a blog post in the offical blog from blogger about the topic: Blogger Buzz: Facebook Your Blog