← Blog Home
June 30, 2014
One of Pentaho’s great passions is to empower organizations to take advantage of amazing innovations in Big Data to solve new challenges using the existing skill sets they have in their organizations today. Our Pentaho Labs’ innovations around natively integrating data engineering and analytics with Big Data platforms like Hadoop and Storm have already led dozens of customers to deploy next-generation Big Data solutions. Examples of these solutions include optimizing data warehousing architectures, leveraging Hadoop as a cost effective data refinery , and performing advanced analytics on diverse data sources to achieve a broader 360-degree view of customers . Not since the early days of Hadoop have we seen so much excitement around a new Big Data technology as we see right now with Apache Spark . Spark is a Hadoop-compatible computing system that makes big data analysis drastically faster, through in-memory computation, and simpler to write, through easy APIs in Java, Scala and Python. With the second annual Spark Summit taking place this week in San Francisco, I wanted to share some of the early work Pentaho Labs and our partners over at Databricks are collaborating on to deeply...
June 25, 2014
I heard a news story on the radio today about stock markets going quiet during World Cup events, especially when the home country is on the field. This made me think about how live activities affect the major markets. My colleague Bo Borland at Pentaho posed an interesting question on this topic just yesterday at MongoDB World in New York, “ Do real time Tweets have an affect on the stock markets ?” Working for a Big Data integration and analytics company, Bo of course used Pentaho tools to see if there was indeed a correlation. A cool idea, but what resulted was even cooler than I’d imagined…. Using Pentaho Data Integration, Bo easily pulled minute-by-minute stock tick data which is highly structured, and blended it with unstructured Twitter data. Next, he pushed the blended data into a MongoDB collection to take advantage of its flexibility. (Note: Bo is also the author of Pentaho Analytics for MongoDB ). Taking the integration and analysis a step further, he scored the tweet sentiment by including a Weka predictive algorithm as part of the data ingestion process from Twitter. Once the data was...
June 23, 2014
You can’t predict tomorrow with yesterday’s tools. At Pentaho, this has been a core tenant in staying nimble and innovating in this disruptive market. Today, at MongoDB World in New York, we announced Pentaho Business Analytics 5.1 , a culmination of speed of innovation and community and customer engagement. Pentaho 5.1 supports our ongoing strategy to make big data analytics faster—at scale—and easier and more accessible for more users. The most powerful insights are revealed when Big Data can be accessed and blended data at the source. 5.1 enables users to do this in a seamless way eliminating the need for specialized set of skills and bridging the data-to-analytics divide. Our recent Data Science Pack blog post, references analyst research estimating that the top two time-consuming big data tasks are solving data quality and consistency issues (46%) and preparing data for integration (52%). We know a huge amount of resources are spent just getting data ‘ready’ to discover the greatest land mine or gold mine of data. In 5.1 we are streamlining the big data process and making big data a reality for all with three innovations including: Direct analytics...
June 23, 2014
One of my favorite aspects of being CEO of Pentaho is the opportunity to talk to our customers around the world. Innovative and motivated individuals and teams are turning data into value and making a major impact for their organization, and in some cases for the better of society. We are proud to announce first annual Pentaho Excellence Awards to recognize and honor our customers and users, rewarding those that have deployed Pentaho technologies in impressive and innovative ways. The Pentaho Excellence Awards offer an opportunity for you and your team to receive industry recognition for your expertise in analytics and big data deployments and thought leadership. While we know your teams are busy helping to make faster and smarter business decisions, here is the link to more information about the Pentaho Customer Excellence Awards and our short nomination process: http://bit.ly/PWorldPEA . Nominations are open until July 11th . A panel of expert judges will pick a winner in six different categories. Category winners receive a free pass to PentahoWorld in Orlando October 8-10, 2014, along with several additional unique opportunities at the event such as a VIP dinner, speaking...
June 18, 2014
The past few weeks we’ve been giddy with excitement about several awards we've received celebrating our big data technology and how customers are applying it to reap big benefits. The latest awards Pentaho along with our customers have added to our growing trophy case include: The CRN Big Data 100 list identifies vendors that have demonstrated an ability to innovate in bringing to market products and services that help businesses work with big data. Pentaho is proud to be named to the Big Data 100 list for the second year in the business analytics category . The award noted Pentaho’s record 83 percent bookings growth in 2013 for big data and embedded analytics products. In addition, the addition of Christopher Dziekan, previously head of analytics product strategy at IBM as Pentaho’s new Chief Product Officer. Each year the SD Times 100 recognizes companies, non-commercial organizations, open source projects and other initiatives for their innovation and leadership. Judged by the editors of SD Times, the SD Times 100 recognizes the top innovators and leaders in multiple software development industry areas. Pentaho was selected as a top 10 leader for Big Data...
June 16, 2014
Once upon a time, (not so) long ago in 2004, two young technologies were born from the same open source origins – Hadoop and Pentaho. Both evolved quickly from the market’s demand for better, larger-scale analytics, that could be adopted faster to benefit more players Most who adopt Hadoop want to be disruptive leaders in their market without breaking the bank. Earlier this month at Hadoop Summit 2014 , I talked to many people who told me, “I’d like to get off of <insert old proprietary software here> for my new big data applications and that’s why we’re looking at Pentaho.” It’s simple – no company is going to adopt Hadoop and then turn around and pay the likes of Informatica, Oracle or SAS outrageous amounts for data engineering or analytics. Big data is the asteroid that has hit the tech market and changed its landscape forever, giving life to new business models and architectures based on open source technologies. First the ancient dinosaurs ignored open source, then they fought it and now they are trying to embrace it. But the mighty force of evolution had other plans. Dinosaurs are...
June 10, 2014
We’re thrilled to announce that registration is open for PentahoWorld , our first global conference that brings together Pentaho users, advocates, and partners to help each other solve challenges around data integration, big data, and embedded analytics. You can register here , but if you need a bit more convincing, here are a few of the top reasons we’re excited about PentahoWorld – and why you should be too. Getting on top of industry trends – PentahoWorld will be a unique gathering of hundreds of people who, every day, are solving challenging problems on the cutting edge of a rapidly changing data landscape. With all those brilliant minds in one room, you’ll learn about solutions they’re crafting today, what they see coming in the future, and what you should be thinking about for your own company. And we have no doubt you’ll contribute some unique insights of your own. Meet the experts – If you’ve got questions, this is the place for answers. Between Pentaho product experts, power users, advocates, community leaders, and people who’ve applied Pentaho in every imaginable way, we can’t imagine a denser concentration of Pentaho expertise...
June 6, 2014
While most of this year’s Hadoop Summit sessions still conveyed ‘developer conference,’ rife with command-line driven demos and Java, Scala, and Python code snippets, I noticed the ‘commercial’ uniform of khakis, blazers and Docksiders starting to creep in. Indeed, the themes I noticed most at the Summit were “enterprise ready” and “next-generation data platform.” So if the Summit’s days as an all-out geekfest are history, what does this say about Hadoop? I happen to think it’s great news: it says Hadoop is going mainstream and being embraced as core to the enterprise data platform. Nothing drives this home more convincingly than the fact that the Hadoop “enterprise ready” ecosystem has exploded from less than ten vendors five years ago to more than 80 vendor sponsors at this year’s show. In this our fifth year sponsoring the Summit, we were just as pumped as we were attending and sponsoring our first Hadoop Summit back in June 2010 right after the launched our first Hadoop product set . This year saw a record crowd (3,200+ attendees from 1,100 different companies), informative breakout sessions, fun parties, and lots of energy and passion throughout...
June 3, 2014
If you are or have a data scientist in house you’re in for good news. Today at Hadoop Summit in San Jose, Pentaho unveiled a toolkit built specifically for data scientists to simplify the messy, time-consuming data preparation, cleansing and orchestration of analytic data sets. Don’t just take it from us... The Ventana Research Big Data Analytics Benchmark Research estimates the top two time-consuming big data tasks are solving data quality and consistency issues (46%) and preparing data for integration (52%). That’s a whopping amount of time just spent getting data prepped and cleansed, not to mention the time spent in post processing results. Imagine if time spent preparing, managing and orchestrating these processes could be handed off to a personal assistant leaving more time to focus on analyzing and applying advanced and predictive algorithms to data (i.e. doing what a data scientist is paid to do). Enter the Pentaho Data Science Pack, the personal assistant to the data scientist. Built to help operationalize advanced analytic models as part of a big data flow, the data science pack leverages familiar tools like R, the most-used tool for data scientists and...