Google Ads

Thursday, November 27, 2014

Alunos do Curso de ETL e DW usando o Kettle/PDI (São Paulo - 25, 26 e 27 de Novembro de 2014)

Fotografia da turma de alunos do Curso de ETL e Data Warehouse usando o Kettle/PDI (Pentaho Data Integration) realizada em São Paulo nos dias 25, 26 e 27 de Novembro de 2014.

Parabéns aos alunos Marcus Vinicius de Oliveira, Flavio Roberto Freitas Senos (Carioca), José Augusto Peregrino da Silva, Renato Augusto Souza Martins, Sergio Luiz Ferreira Maia.

Sucesso com o Kettle/PDI em seus projetos!!!

Tuesday, November 18, 2014

PCM14 – Pentaho Community Meeting 2014 in Antwerp, Belgium

Hi folks,

The guys from did a great job writing about the PCM14 – Pentaho Community Meeting 2014 in Antwerp, Belgium in their blog.

See all the comments below about the community meeting event:

Pedro Alves

Pedro was the lucky one to open the Pentaho Community Edition 2014. He started to talk about the new features in the new 5.2 version.

First of all the new features in the marketplace.

A total rewrite of the marketplace is done and some constants are added. Mainly so customers can see the difference between Pentaho and community edition features. The plugins supported by Pentaho are marked in blue, the community plugins are marked in green.  Also the plugins Ctools and Sparkle are already included in the new version.

A new feature is the marketplace web to browse all available plugins for server or PDI. The goal of this new web place is that the user can view all plugins even when they are not on a server.

Web Marketplace


A major reset was done on the PRD crosstabs story. In the 5.2 version they are working!

New kettle step

A new kettle steop is developed to make the integration of the platform easier. The new step is called ‘Call endpoint’. From the step you can point to the Pentaho webserver with the username and password. True which you can communicate with the platform such as schedule and roles information.

Community startup tabs

CST will be available as a plugin in the marketplace

Two new features were added:

The option to have a launcher page with the startup tabs instead of having them open automatically
Integration with the favorites features
Ctools improvements

Bootstrap is, now, the default layouting system for CDE dashboards. Bootstrap is now available for building the dashboards. The biggest advantage is the ability to rescale to different devices.  The CDE layout is improved to work better with bootstrap such as the Panel.  To show the new efficiency of Bootstrap Pedro’s entire presentation was a CDE Dashboard. When he resize the presentation everything scales well.

Jens Bleuel
Kettle Telemetry and Usage Statistics

Jens is gathering information about what’s happening in the field. He created a job to analyze transformation and job entries. The main goal is to download the transformation from Github, which you can run and send him anonymous the results.  He is still experimenting with the possibilities and the gathered results.

PDI Telemetry

Roland Bouman
Roland is here to talk about his Pentaho Analysis editor: Phase.  There are a lot of challenges with all the external model editors, manly the communication and the alignment between the editor and the server. When using the internal editors, you will also encounter difficulties or reach the limit of the editor.

Roland created Phase, an internal editor, which makes it easy for the end user.  You can download Phase from the marketplace. Phase looks a lot like schema workbench but far less complex. Complicated information about the cube is removed. For example they join statements are removed from the cube and assembled in an overview of the database connections.

In Phase you can get an instant overview of your schema via the tab Diagram. This tab provides a way to view the total schema and edit it at the same time.

Phase is not fully finished, Roland is still working on several things such as the Annotation and Properties tab.

Will Gorman
Architectural Vision for 6.0 and Beyond + Pentaho Engineering Update

Will is gone talk about the roadmap of Pentaho for the next years. The main goals are:

Blending two worlds together
Tailored Visual experiences
Ongoing Product investment supporting Big Data Blueprints
Governed Data Delivery
Big Data integration@Scale
In the future they want to improve Pentaho to a more simple architecture, more scalable and customizable by the user, with a high focus on User experience.

Matt Casters
Matt hosted a user input session to discover the missing components and cracks in Kettle. The main goal is to document these cases and file Jira case to improve Kettle.

The example Matt gave is the mixing row layout when you combine two steps with a different layout Kettle throws only an error when you make the hope. But Kettle doesn’t throw an error if the hops already exist and you have added extra fields.

Mixing row layout

Some users gave input to in their experience improvements for Kettle

Saving the comment in the kettle property file
Order of elements in metadata
Life cycle management
The duration to create a note
More OS X support
Test sets
Editor for the JNDI file
Matt is putting a lot of effort so that Kettle meets the requirements of the users.

Uwe Geercken

Rule engine PDI Step and Adjacent Tools

Uwe developed the rule engine step, which you can download from the marketplace.

The rule engine can be used to filter data. The step will take the data and process it compared by all defined rules without adding additional rows. The result will be directly added to the row itself.

Each rule engine step is based on a rule group whit different rules and rule groups. This rules can be maintained by an online maintenance tool.

The rule engine can also be used to check the data quality and data update. The rule engine can be used in several places: PDI, web application, standalone server …

Joel Latino

Ivy Git Sync

Joel created a Git integration for all bi solution. There are different kind of Git steps: Git info step and Git operations step

The plugin is built in Sparkl with some Java endpoints. The advantage of the plugin are two sided: the client can use the different tags to install the right version and the techie can easily rollback.

Thomas Morgner

Simple reports fastified – designing reports for lightning fast exports

Thomas will be talking about the main improvements in PRD.

Instant query meta-data at design-time
Non-blocking design time queries for slow mode
Bulk-data exports (aka fast-mode)
With the improvements the reports are now 5 times faster. For the next few releases everything will be stabilized.

Raimonds Simanovskis

Raimonds from eazyBI, uses for his business intelligence development JRuby, Ruby on Rails, mondrain-olap and many other technologies. In his session he showed us that it is possible to create an entire MDX or define a cube in JRuby. In this cube you can define the cells, properties, members

Raimonds showed us some other ways to create an MDX scheme or a cube first in Javascript and then in Coffeescript. CoffeeScript is a little language that compiles into Javascript. Buth it’s simpler than Javascript.

Marcello Pontes

Sparkl Plugin – Analysis Security Mac Easy

Marcello created a plugin that intends to deliver a dynamic layer for security for Mondrian – olap engine.

With this plugin, the developers can use the ETL tool to customize the access level. At the moment it’s not yet in the marketplace but it’s already available in Github.

You can maintenance the security for each dataset via a dashboard.


You are able to change the file, to add a property for the username. So you can define security for your own users.

The main advantages of the plugin are:

Session variables to customize the result set or XML
No need of changing the java code
More flexibility on the datasource

André Simões
Flume was originally a log collector, aggregator and housecleaner. It’s a driver that provides integration between Pentaho Data Integration and Apache Flume. The plugin provides a way to integrate Flume without writing your own Java code.

Flume has a source that converts a log file, a channel and a sink, the output.

You can download the kettle-flume-driver from Github. So far it’s only a proof of concept.

Nelson Sousa
A Generic ETL Machine – The Mother of All ETLs

Nelson was tired of developing always the same ETL, so he developed GEM. Gem is a framework to develop ETL tasks, a flexible and fast loader … You still need to understand PDI before starting with GEM.

GEM is not finished jet: it hasn’t a front end, it isn’t Hadoop ready …

GEM uses a property file where you describe your database, mail and other properties. The properties are used to execute the generic ETL. Therefor you can easily switch between your input steps, queries and other stuff.

Diethard Steiner

Pentaho Sparkl  – Supercharge your Business Analytics Server

Diethard created the Bissol Table Data Editor in Sparkl. The plugin allows you to update and add data to small database tables via an easy-to-use-web-interface. It’s not only a data editor, the plugin allows you also to alter the table definition.

Table data editor

Pedro Teixeira

Event Bus

Pedro developed a plugin to provide the ability to push notifications to BA-server. The plugin is a EventListeren-based mechanism with a publish/subscribe style.

Pedro build this plugin as a Sparkle plugin, at the moment the development is still in progress.

The plugin lets you send different push notifications to any of the users on the server. These notifications are varying such as a mail, twitter or normal notification.


Thursday, November 13, 2014

How to Install Pentaho CE Audit Plugin on Pentaho BI Server CE 5.2

What is Pentaho CE Audit Plugin?

It is an open source project that helps Pentaho BI Admin users to visualize statistics like:

  • Who is logging in the BI Portal (Pentaho BI Server)?
  • When are the users using the BI Portal?
  • What is the content with more access?
  • And a lot of others infos about the use of the BI Portal.

Learn more
Feel free to use it for Production Environment if it works correctly for you.

This plug-in will apply automatically an Environment configuration, if you want to know more about it please visit the link

For Pentaho BA Users, please use Pentaho Marketplace to install it.

If you need support please feel free to create a issue here we will do our best to help you.

Pentaho CE Audit Plugin is a free and open source software. It is available under the terms of the Apache License Version 2. A copy is attached for your convenience.


Microsoft corrige falha grave que afeta versões do Windows há 19 anos

Amigo leitor,

Quantas vezes alguém já perguntou se o sistema que você desenvolveu é 100% seguro ou se o sistema que você utiliza é 100% seguro?

Bem, constantemente sou colocado na parede e a resposta que dou é sempre a mesma: não existe sistema 100% seguro.

Colocar um software em produção é um super desafio, e acredito que ninguém consegue colocar alguma coisa para funcionar que seja 100% segura. Inclusive nossas urnas eletrônicas estão sendo investigadas...

Hoje encontrei uma reportagem com o título: "Microsoft corrige falha grave que afeta versões do Windows há 19 anos".

Isso mesmo, a super Microsoft, amada por muitas pessoas, corrigiu uma grande falha de segurança que deixou o Windows aberto por 19 anos!!!

Sério 19 anos!!! Parece piada mas não é, realmente essa falha foi encontrada pela IBM e foi corrigida hoje, depois de 19 anos.

Então se você usa Windows como Desktop ou Servidor chegou o grande dia de resolver essa falha atualizando o seu Windows pelo Windows Update.

Para saber mais acesse:

Alguém conhece outra empresa que levou 19 anos para resolver uma falha de segurança, e ainda mais, algo tão grave que deixou durante anos servidores e desktops vulneráveis?