Palantir Finance Applied to Log4J Data
August 26th, 2009 |
In a previous post, Eric W. covered how we analyze polled system health information. Now we’ll look at pushed information, in the form of logging events.
Use Cases & Constraints
We decided on three kinds of questions we wanted to answer:
- What is the health of the deployment?
- Example: What errors have occurred in the last 24 hours?
- Which parts of the platform are our users engaged with?
- Example: How much time do users spend in each application?
- How is our server performing over time?
- Example: What is the average wait on a search query?
The chief constraint was that we build our platform on Log4J. We already use Log4J all over the project, so converting the logging was out of the question. Besides, Log4J provides a guideline for the kind of metadata our events should support, and Log4J makes it easy to record events to a database.
That left us with two problems to solve: how to store structured data with a Log4j message, and how to analyze the collected data.
Analysis is the easy part: just use Palantir! After all, a sequence of logging events has a lot in common with a time series. The rest is explained below.




