TL;DR: In March 2020, we established the official CNCF’s Special Interest Group (SIG) for Observability. Last week, the TOC vote for the first Tech Lead for this SIG was closed, and results were published. I am proud to announce that I have been elected! ๐Ÿคฉ

In this blog post, I would like to briefly explain what is the idea behind SIGs, what our “newly” created SIG for Observability is for. Last but not the least, I will share why I decided to help and what tech lead role in the SIG context means.

EDIT: Since writing this the CNCF changed the name of all groups from SIG to TAG (Technical Advisory Group).

Special Interest Groups for the Cloud Native Computing Foundation

Around April 2019, the CNCF approved the initiative called Special Interest Groups (SIGs). The main idea behind SIGs is to scale contributions. What does it mean? Well, CNCF is growing rapidly. The current number of projects donated or related to the CNCF is just enormous.

CNCF Landscape: How to manage this open source machine?

The popularity of the CNCF universe is undoubtedly high. The last pre-COVID CNCF conference was in San Diego in 2019, and it was one of the largest. It had nearly 10k physical (!) attendees. Looking at the latest stats, the activity in all CNCF’s Projects on just GitHub (code, reviews, comments) every hour is 500 on average with peaks up to 1k. And that does not even count activity in thousands of open source dependencies, tools and projects related to the main repositories, and all non-GitHub activity like participating or organizing events (meetups, conferences), mentoring, teaching, etc.

Peak of 1k hourly activities on main GitHub repos? That gives a single, meaningful human contribution event every 3.5999928 seconds!

Given the number of projects and activities, before SIGs, there was a major slowdown in the various decisions making, innovations, and organizational activities. Simply put, the TOC model did not scale. Having a dozen experienced and amazing people from different companies that control CNCF space is amazing, but there are just too many topics. There has to be some method of delegating some duties and efforts to further people that are passionate in one of many “cloud” areas, specific to their domain expertise. The similar problem was already solved by the Kubernetes project itself with like “divide & conquer” in a form of creating domain-specific interest groups.

BTW the whole concept isn’t new! The first SIG was created in 1961 for the Association for Computing Machinery (ACM) society.

This is how the various SIGs were slowly created on the CNCF side too! As of September 2020 we have 7 SIGs. Each with some TOC members as the Liaisons:

Name TOC Liaisons
SIG Security Liz Rice, Justin Cormack
SIG Storage Xiang Li
SIG App Delivery Michelle Noorali, Katie Gamanji
SIG Network Matt Klein
SIG Runtime Brendan Burns, Alena Prokharchyk
SIG Contributor Strategy Matt Klein
SIG Observability Jeff Brewer, Brendan Burns

Liaison: a person who acts as a link to assist communication or cooperation between people.

Next to the Liaison, SIG consists of Chairs, Tech Leads, and Members. (Fun fact: Each SIG can also define its own specific roles)

General SIG Objectives

Each of the SIGs in the CNCF has, in general, the following objectives. The objectives you can find on the TOC repo are self-explanatory:

  • Strengthen the project ecosystem to meet the needs of end users and project contributors.
  • Identify gaps in the CNCF project portfolio. Find and attract projects to fill these gaps.
  • Educate and inform users with unbiased, effective, and practically useful information.
  • Focus attention & resources on helping foster project maturity, systematically across CNCF projects.
  • Clarify the relationship between projects, CNCF project staff, and community volunteers.
  • Engage more communities and create an on-ramp to effective TOC contribution & recognition.
  • Reduce some project workload on TOC while retaining executive control & tonal integrity with this elected body.
  • Avoid creating a platform for politics between vendors.

Observability SIG

Around February 2020, most of the CNCF projects had each corresponding SIG created. This means they had better support and for any potential graduations, annual reviews, or initiatives. The number of topics from Observability related projects piled up, so it was about the time, Matt Young started conversations about potential Observability SIG:

First interaction

Long story short, thanks to amazing people listed here, we completed detailed CNCF SIG Observability Charter and got approved by the TOC as an official Interest Group. ๐Ÿค—

I won’t bore you with the details, you can read the full manifest here, but overall our objectives are exactly the same as provided above. The only difference is our specialization for Cloud Native Observability topics and projects related to observability. For example, those hosted currently under the CNCF umbrella:

Project’s under the CNCF in different stages

For comprehensive read-up and TL;DR of SIG Observability, I really recommend Richi’s amazing post on the CNCF blog.

How You Can Reach us?

Everyone is welcome to ask questions / propose topics to talk through! You can do that in many ways:

Why I Proposed to do (Another) Technical Leadership in my ~Free Time.

Let’s be honest, similar to other roles (TOC Members, SIG Chairs, Members, Contributors), being a SIG Tech Lead is just voluntary, free work. So why would you do that, given other amazing things you could do? (For example: Resting during the weekend on Santorini, Greece instead of writing this blog post. Don’t do this ๐Ÿ™ƒ)

  • I think the main reason is that there is just a strong need for work in this area. Someone has to offload TOC members and the CNCF itself in their duties. And observability is arguably one of the most important, tricky, and costly element of every web application (in some cases more expensive than monitored applications itself!)
  • Given my experience and passion to observability, infrastructure & open source I thought that if I can help with something let’s do it fully! (:
  • I have already worked with many CNCF projects, communities, but also some of the staff: the CNCF CTO Chris, Program Manager Amye, our Event Hero Nanci developer advocate Ihor and more. The thing is that working with each of those people is incredibly amazing: fruitful, productive and at the end: fun! On top of this, the CNCF hosts projects I maintain, contribute and use every day (Kubernetes, Thanos, Prometheus and more). It feels just so fun and relevant to contribute more in this space, so I am grateful for this opportunity. The people active in the SIG Observability so far are amazing too, you should meet them as well! After all, it’s all about people.
  • I have seen a good SIG Tech Lead examples in my life and just got inspired by the value they were adding. Here I would love to give a shout-out to ex-colleague Frederic, Tech Lead of the Kubernetes SIG Instrumentation, who mentored me a bit and introduced in 2019 to Instrumentation SIG. While I was mostly passive there, I learned a lot from Frederic. Seeing good tech leading skills in SIG contexts, gave me motivation to do similar in the CNCF space.
  • Last but not the least, I am grateful to work at Red Hat which instead of punishing me for spending bit of my work time on SIG contributions (like 98% close source companies would do), this company actually highly value contributions like this, giving me room to do technical leadership outside of the Red Hat and core projects we maintain as well! ๐Ÿ’ช

Red Hat value, seen with my own eyes & definitely no joke: To be the catalyst in communities of customers, contributors, and partners creating better technology the open source way.

So… What’s the Plan?

Don’t get me wrong, overall I can have many ideas, but in the end, it’s really up to the community and all the SIG members what we can achieve! (:

Technically SIG Tech Lead duties are:

  • To support projects in the SIGโ€™s area.
  • To have the time and ability to perform deep technical dives on projects. Projects may include formal CNCF projects or other projects in the area covered by the SIG.

It’s really more about the hard work and the responsibility than any major decision making. And that’s actually quite nice as I would love to stay out of politics as much as possible. (: Instead, let’s talk about actions, usability, and the things that we can improve for better… observability!

As I mentioned, overall it’s all about where the SIG members will focus, however since you are still reading this ๐Ÿ˜ˆ, I can briefly inject some cool stuff that we can improve in the CNCF Observability world!

I mentioned most of the things in my nomination doc here, but let’s go quickly through some of those:

Supporting Projects

The key responsibility of the SIG Observability is to help the CNCF projects, to provide guidance for any technical decisions and support the areas where they need help. Additionally, for a project to proceed through different stages (Sandbox, Incubated, Graduated); it has to fulfil a couple of, more or less, strict rules (see Due Diligence doc). Those rules are actually quite solid, they aim for project growth, fairness, reliability, and open-source values. My role is to review, point out gaps, and help to resolve those.

Action Item: If you are a member / contributor of the CNCF project within SIG Observability OR you have any questions related to observability for any other open-source project (e.g Kubernetes) reach us!

Connecting Passionated People; Sharing Knowledge Between Projects

Depending on the origins of the projects, the CNCF projects collaborate with each other. Some more, some less. For example, both Cortex, Thanos, and OpenMetrics projects were created by Prometheus maintainers, thus it’s natural we share some code pieces and patterns. Overall we work and communicate with each other a lot!

This, however, is not always the case between other projects; even though we are all hosted under the CNCF. The truth is, that there is a huge potential of helping each other much more than what we have now. For example, while maintaining a big open source project like Thanos, there are many things we had to invent or build from scratch to suit our additional needs, such as:

  • A suite of static analysis tools and Go style guide
  • Auto-generated documentation (recently shaping it out of our bash into mdox tooling).
  • Development tooling e.g bingo,
  • Multi-arch artifact building e.g promu,
  • End-to-end test frameworks (we share an awesome e2e library with Cortex),
  • A website with versioned documentation, search and blog post space,
  • Arrangements and best practices for Mentorship/Internships e.g Student Office Hours,
  • A solid Open Governance model,
  • Various configuration practices and methodologies,
  • Plugins,
  • And marketing activities (Active Twitter account, Talks),

…and much more! This, sometimes annoying, stuff is vital for the project to be usable, reliable, and easy to start with. So why almost every project rebuild these tooling and facilities from scratch rather than reuse them? (: It would be nice to get together and learn from each other, especially when we are part of the same foundation and domain! (:

Action Item: If you are active in a CNCF project and need help, try to check out if a similar problem has already been solved in another project! You can use SIG Observability space for this freely, so reach us!

Be Open Minded for Outside World

Another point I want to highlight is that, as the SIG Observability, we are not limited to a few hosted projects only. There are certainly missing pieces``` in cloud-native observability portfolio. Things that we could learn. Projects that already comply, integrate or even are already extremely useful for the whole observability journey in the CNCF ecosystem but not directly under the CNCF governance. I am looking at you: Grafana, Loki and ConProf!

It would be awesome to maintain a good relationship with such projects. Allow even better integration with them, communicate more, and help each other!

Action Item: If you feel that some projects or initiatives are extremely useful, however missing in the CNCF Observability portfolio, or something we can learn from, please reach out to us as well!

Summary

View from Pyrgos village on Santorini island, Greece

I hope with this blog post, you have learned a bit what this magical SIG concept is all about. And how YOU can help within the domain you are passionated about!

Thanks to all who voted for me and especially thanks to Richi for some epic mentoring in this area ๐Ÿ’ช

See Ya on our SIG Observability Meetings ๐Ÿ‘‹