Embrace Risk
A newsletter for automation, reliability engineering practices and culture, scalability, availability, incidents and more!
Sign Up To This NewsletterRecent Summaries
Embrace Risk | Revison 106 | December 23, 2024 | Opened: 0
Takeaways:
-
Observability in Tech: Understand how tech giants leverage eBPF to enhance observability metrics.
-
Kubernetes Testing: Explore the application of Testcontainers with Python for effective end-to-end testing within Kubernetes environments.
-
KEDA Implementation: Learn about scaling applications to zero on Google Kubernetes Engine using KEDA.
-
Kubernetes Updates: Familiarize yourself with Kubernetes v1.32’s new CPU Manager static policy for strict CPU reservation.
-
Data Engineer Strategies: Delve into best practices for testing, monitoring, and observability in data engineering.
-
Telemetry Management: Gain insights on sending telemetry data to AWS S3 and the role of the OTel Collector.
-
OpenTelemetry Standards: Engage with the discussion surrounding OpenTelemetry and its significance in establishing formal standards.
-
EKS Auto Mode: Clarify what EKS Auto Mode is and its implications for managing Kubernetes clusters.
Links:
Embrace Risk | Revision 105 | December 16, 2024 | Opened: 3
Takeaways:
-
New updates on Open Policy Agent usage in Skipper Ingress.
-
Insights on mapping reliability to accountability in systems.
-
Exploration of tools and strategies for optimizing observability in Kubernetes and carbon footprint management.
-
Best practices for keeping User Journey SLOs up-to-date with end-to-end testing in microservices architectures.
-
Key highlights from AWS re:Invent2024 related to Amazon CloudWatch and Kubernetes v1.32.
Links:
Embrace Risk | Revision 104 | December 09, 2024 | Opened: 7
Takeaways:
-
Microservices can lead to technical debt, increasing complexity in system management.
-
Amazon EKS is adapting with new features such as hybrid nodes and auto mode for streamlined Kubernetes management.
-
Data-driven incident management is crucial; leveraging SLOs can significantly enhance performance during incidents.
-
Keeping systems updated and understanding timing issues in distributed systems can prevent operational problems.
-
The latest DORA Accelerate State of DevOps report is available for insights into current trends in DevOps.
Links:
Embrace Risk | Revision 103 | November 18, 2024 | Opened: 12
Takeaways:
-
Kubernetes RBAC improvements can enhance security posture in K8s environments.
-
Recognize challenges and limitations when moving away from Kubernetes.
-
Building an AI agent can optimize Site Reliability Engineering (SRE) practices.
-
Celebrating Go’s15th anniversary highlights its evolution and community impact.
-
Utilizing metrics from the OpenTelemetry collector aids in effective scaling solutions.
-
SQL-based observability is evolving to address complex data insights and management.
-
Understanding IT engineers’ mental models can improve team dynamics and workflows.
-
Cautionary tales highlight the importance of following cloud service guidelines (e.g., AWS Amplify).
Links:
Embrace Risk | Revision 102 | November 11, 2024 | Opened: 17
Takeaways:
-
Kubernetes Operations Redefined: The Karpenter Effect reveals advancements in Kubernetes management and operational efficiency.
-
Observability Enhancement: eBPF technology strengthens observability in cloud-native environments, improving performance monitoring.
-
OpenTelemetry Expansion: OpenTelemetry is enhancing its capabilities into CI/CD observability, streamlining monitoring and feedback loops in development.
Links: