Protocol Buffers (Protobuf) is a platform neutral serialization library implemented and widely used by Google. However, once you start working with Protobuf you quickly realise that the trade-off for high performance serialization means having to accept severe restrictions in the way you are able to model your data, the tendency for Protobuf constructs to 'leak out' all over your codebase and new challenges in terms of managing data model version evolution over time. By combining...

Low data quality creates significant costs to large and small enterprises alike. Not only does malformed data have the potential to cause impact on production systems, it is also very expensive to manually correct, whilst at the same time severely hampers the effectiveness of advanced techniques such as machine-learning and other innovative uses of enterprise data. The causes of low data quality are varied and myriad. Typically these result from fragmentation and a large number...

A key TeraHelix offering is the ability to integrate and run third-party analytics libraries as a standard part of the platform. So rather than taking the data to the code, TeraHelix allows you to bring your code to the data. There are many analytics library choices out there, however in this blog we will explore a concrete integration example using OpenGamma’s excellent open source Strata library. OpenGamma’s Strata Strata is an open source analytics and...

The design and implementation of new web APIs and applications require security to be built-in from the start rather somehow being engineered in later as an afterthought. Yet, when one starts development, is not always that straightforward to set up your local environment such that you can be confident that what 'works on my machine' will be translated to when the application is deployed in an environment. Local Development Security Pitfalls Most developers typically start...