InfoSphere Streams version 3.2.1 was released last week and contains some enhancements to the Streams Studio development environment. The enhancements are: get all logs action, exported streams content assist and Streams Explorer information and parallel region identifier in graphical editor.
With the release of InfoSphere Streams 3.2.1, a new collection of operators to support reading from and writing to the Hadoop Distributed File System (HDFS) are available. The new HDFS2 operators add a range of new features while continuing to support the same functionality that is available in the existing HDFS operators.
I was thinking today that a tip of week blog would be useful. I decided to ask some people in the Streams development organization what a good tip of the week would be. I did this for a couple of reasons, one so I could find people to help me keep this going with a new tip every week and secondly to see if I would learn anything. The first person I asked told me something I did not know.
We are always looking for ways to make Streams easier and more productive for our developers! Streams Quick Start edition was the first step — we wanted to make Streams available at no charge without technical limitations so you can prove how much value it brings you and your company. The streamsDev community was the second step — we wanted to have a place to share deep technical content and exchange ideas with an interactive forum. But now we are going much further!
We have started an organization on GitHub called IBMStreams. This will be a place for our rapidly growing developer community to create, share, and extend Streams toolkits.
This lab uses the Streams toolkit com.ibm.streams.rproject to execute an R script on tuples flowing through a Streams application. The lab is based on a fairly simple network flow data (NetFlow) example.