POPULARITY
Maxwell, a solution architect at xMatters, took a winding road to get to where he is. After a computer engineering education, he held jobs as field support engineer, product manager, SRE, and finally his current role as a solutions architect, where he serves as something of an SRE for SREs, helping them solve incident management problems with the help of xMatters. When he moved to the SRE role, Maxwell wanted to get back to doing technical work. It was a lateral move within his company, which was migrating an on-prem solution into the cloud. It's a journey that plenty of companies are making now: breaking an application into microservices, running processes in containers, and using Kubernetes to orchestrate the whole thing. Non-production environments would go down and waste SRE time, making it harder to address problems in the production pipeline. At the heart of their issues was the incident response process. They had several bottlenecks that prevented them from delivering value to their customers quickly. Incidents would send emails to the relevant engineers, sometimes 20 on a single email, which made it easy for any one engineer to ignore the problem—someone else has got this. They had a bad silo problem, where escalating to the right person across groups became an issue of its own. And of course, most of this was manual. Their MTTR—mean time to resolve—was lagging. Maxwell moved over to xMatters because they managed to solve these problems through clever automation. Their product automates the scheduling and notification process so that the right person knows about the incident as soon as possible. At the core of this process was a different MTTR—mean time to respond. Once an engineer started working to resolve a problem, it was all down to runbooks and skill. But the lag between the initial incident and that start was the real slowdown. It's not just the response from the first SRE on call. It's the other escalations down the line—to data engineers, for example—that can eat away time. They've worked hard to make escalation configuration easy. It not only handles who's responsible for specific services and metrics, but who's in the escalation chain from there. When the incident hits, the notifications go out through a series of configured channels; maybe it tries a chat program first, then email, then SMS. The on-call process is often a source of dread, but automating the escalation process can take some of the sting out of it. Check out the episode to learn more.
Maxwell, a solution architect at xMatters, took a winding road to get to where he is. After a computer engineering education, he held jobs as field support engineer, product manager, SRE, and finally his current role as a solutions architect, where he serves as something of an SRE for SREs, helping them solve incident management problems with the help of xMatters. When he moved to the SRE role, Maxwell wanted to get back to doing technical work. It was a lateral move within his company, which was migrating an on-prem solution into the cloud. It's a journey that plenty of companies are making now: breaking an application into microservices, running processes in containers, and using Kubernetes to orchestrate the whole thing. Non-production environments would go down and waste SRE time, making it harder to address problems in the production pipeline. At the heart of their issues was the incident response process. They had several bottlenecks that prevented them from delivering value to their customers quickly. Incidents would send emails to the relevant engineers, sometimes 20 on a single email, which made it easy for any one engineer to ignore the problem—someone else has got this. They had a bad silo problem, where escalating to the right person across groups became an issue of its own. And of course, most of this was manual. Their MTTR—mean time to resolve—was lagging. Maxwell moved over to xMatters because they managed to solve these problems through clever automation. Their product automates the scheduling and notification process so that the right person knows about the incident as soon as possible. At the core of this process was a different MTTR—mean time to respond. Once an engineer started working to resolve a problem, it was all down to runbooks and skill. But the lag between the initial incident and that start was the real slowdown. It's not just the response from the first SRE on call. It's the other escalations down the line—to data engineers, for example—that can eat away time. They've worked hard to make escalation configuration easy. It not only handles who's responsible for specific services and metrics, but who's in the escalation chain from there. When the incident hits, the notifications go out through a series of configured channels; maybe it tries a chat program first, then email, then SMS. The on-call process is often a source of dread, but automating the escalation process can take some of the sting out of it. Check out the episode to learn more.
This week's episode is brought to you by Oracle NetSuite (sign up for a personalized product tour at www.netsuite.com/scale) and Indeed (get a $75 credit for your job post at indeed.com/podcast). Abbas is the VP of Customers Success and Value Engineering at Segment (now a Twilio company) where he's building a cross-functional global value engineering team. Prior to Segment, Abbas had a 10 year stint at xMatters as CTO where he led the technology team. Abbas embodies the pay it forward mentality by mentoring his time and giving back to the community through his involvement at PlatoHQ (the largest engineering and product mentorship community), as well as EverWise (now a torch company). Connect with Abbas Haider Ali: https://www.linkedin.com/in/abbashaiderali/ Connect with Poya Osgouei: https://www.linkedin.com/in/poyaosgouei/ Connect with Robby Allen: https://www.linkedin.com/in/robbyallen/ --- Support this podcast: https://anchor.fm/uncharted1/support
CEO of xMatters, Troy McAlpin, joins this episode to discuss xMatters’ transition onto GCP, what uniquely differentiates them from other competitors, and how GCP enables them in such a way that no other Cloud Provider can. McAlpin also touches on his customer experience philosophy and how earning and exceeding customer satisfaction, day in and day out, should be the top priority for any SaaS company. Host: Tony SafoianGuest: Troy McAlpin Connect on Twitter: https://twitter.com/cloudnclear https://twitter.com/SADA https://twitter.com/Safoian https://twitter.com/tmcalpinxm Connect on LinkedIn: https://www.linkedin.com/company/sada/ https://www.linkedin.com/in/safoian/ https://www.linkedin.com/in/troy-mcalpin-2a936b6/ To learn more, visit SADA.com.
It’s all about data management this week on the podcast as Brian Dorsey and Mark Mirchandani talk to Google Cloud Product Marketing Manager, Amy Krishnamohan. Amy starts the show by explaining that Cloud SQL is a fully managed relational database service that recently added Microsoft SQL Server to its repertoire. We talk about SQL Server’s migration from 2008R2 to a newer version, the process involved, and how it’s effecting customers. Luckily, Cloud SQL for SQL Server is very backwards compatible, making the process easy for Google Cloud customers! Cloud SQL also offers other tools to make using Microsoft SQL Server easier with Google Cloud, including shortcuts to set up the high availability function. Amy talks later in the show about what companies are a good fit for Microsoft SQL Servers on Google Cloud. She explains the steps to set up and tear down, how licensing works, and what the best use cases are for Microsoft SQL Servers on Google Cloud. In the future, Cloud SQL will have a managed AD service available. A multi-cloud strategy is important, according to Amy. It is up to each company to research cloud services and pick the best vendors and products for themselves and their clients. Cloud SQL for SQL Server is a way to bring two great products together for the benefit of consumers. Amy Krishnamohan Amy Krishnamohan is Product Marketing Manager at Google Cloud responsible for databases. She has diverse experience across product marketing, marketing strategy and product management from leading enterprise software companies such as MariaDB, Teradata, SAP, Accenture, Cisco and Intuit. Amy received her Masters in Software Management from Carnegie Mellon University. Cool things of the week Introducing Cloud AI Platform Pipelines blog Finding a problem at the bottom of the Google stack blog Larger local SSD storage available now blog Compute Engine gets machine images blog Google Cloud Next Update site Interview Microsoft SQL Server site Google Cloud SQL site BigQuery site GCE site Question of the week Lift and shift, move and improve, or re-architect: How do we “move and improve”? GCP Podcast Episode 211: Digital Services with xMatters podcast Importing virtual disks docs Create machine image from virtual appliance file (OVA/OVF) docs Tutorial: Getting started with Migrate for Compute Engine docs Whitepaper: Velostrata technology for mass migrations into Google Cloud Platform whitepaper Where can you find us next? We’re working from home for a while! Brian will be looking at getting a kind of weekly “reading group” of people who work with VMs and want to get better. Ping him on Twitter if you’re interested! Mark will be working on more video content and a cool nickname for Brian!
Priyanka Vergadia joins Mark Mirchandani today to talk shop with Travis DePuy about all things digital services. Travis is a product evangelist for xMatters, a company that provides digital services for clients in a way that makes it easy for them to “limit the blast radius” as they build and use their projects. At xMatters, customers can build an incident management workflow for their custom services and integrate the tools of their choice. Travis talks about service degradation and how xMatters helps clients optimize and manage their services to control instances of degradation. With programs like Google Stackdriver, xMatters can set limits and get alerts when thresholds are met, then use that information to fix performance. Later in the show, Travis talks about moving a large enterprise like xMatters to the cloud. Travis DePuy Travis DePuy is a Tinkerer of Things, Master of Hats and Father of Kitties. He is currently Head Product Evangelist at xMatters where he gets to talk to people about how they are doing Incident Management, DevOps notifications and anything else involving humans, processes and tools. Travis balances the stationary computer work with the fluid moving of Chen Taichi and is often found in the sun flowing the forms of the old ways. Cool things of the week Join us for Google Cloud Next ‘20: Digital Connect blog Connecting businesses and educators with advanced Hangouts Meet capabilities blog Interview xMatters site Site Reliability Engineering Book site Stackdriver site Migrating a monolith to GKE - Customer Story (Get Cooking in Cloud) video Question of the week How can I improve reliability/availability with the least amount of work? Codelabs site Migrating a monolithic application to microservices on Google Kubernetes Engine article Migrating a Monolith to Google Kubernetes Engine — An Overview blog Where can you find us next? You can find Priyanka online in her video series Get Cooking in Cloud and her series on Pub/Sub. You can see Mark in recently released Stack Chat videos.
Troy and Tobias work at xMatters providing tools that help devops engineers manage things when they go wrong. Their tools at xMatters provide information that makes is easier to track down problems. Running a service like xMattters also allows them to become experts in how workflows should go to empower people to fix issues. Follow UpCloud on Twitter: @UpCloud Follow Troy on Twitter: @Tmcalpinxm Connect with Tobias on LinkedIn Panelist Nell Shamrell-Harrington Charles Max Wood Guests Troy McAlpin Tobias Dunn-Krahn Sponsors UpCloud Use promo code " devchat" for $25 off CacheFly _______ "The MaxCoders Guide to Finding Your Dream Developer Job" by Charles Max Wood is now available on Amazon. Get Your Copy Today! ______________________________________ Links OODA loop (x)matters Picks Nell Shamrell-Harrington: A Higher Standard Charles Max Wood: Contigo Water Bottles Conference Swag SOCKS!!! Tobias Dunn-Krahn: BigQuery Chat Bots Tobias Dunn-Krahn: Delivering Happiness Best Friends
Troy and Tobias work at xMatters providing tools that help devops engineers manage things when they go wrong. Their tools at xMatters provide information that makes is easier to track down problems. Running a service like xMattters also allows them to become experts in how workflows should go to empower people to fix issues. Follow UpCloud on Twitter: @UpCloud Follow Troy on Twitter: @Tmcalpinxm Connect with Tobias on LinkedIn Panelist Nell Shamrell-Harrington Charles Max Wood Guests Troy McAlpin Tobias Dunn-Krahn Sponsors UpCloud Use promo code " devchat" for $25 off CacheFly _______ "The MaxCoders Guide to Finding Your Dream Developer Job" by Charles Max Wood is now available on Amazon. Get Your Copy Today! ______________________________________ Links OODA loop (x)matters Picks Nell Shamrell-Harrington: A Higher Standard Charles Max Wood: Contigo Water Bottles Conference Swag SOCKS!!! Tobias Dunn-Krahn: BigQuery Chat Bots Tobias Dunn-Krahn: Delivering Happiness Best Friends
Travis Depuy of xMatters speaks to Leandro & Brian about how to leverage tools like xMatters to enable proactive alerting and remediation throughout your code life-cycle.
Travis Depuy of xMatters speaks to Leandro & Brian about how to leverage tools like xMatters to enable proactive alerting and remediation throughout your code life-cycle.
Travis Depuy of xMatters speaks to Leandro & Brian about how to leverage tools like xMatters to enable proactive alerting and remediation throughout your code life-cycle
Travis Depuy of xMatters speaks to Leandro & Brian about how to leverage tools like xMatters to enable proactive alerting and remediation throughout your code life-cycle
In this episode, you’ll hear Greg, Conor and Travis Depuy from xMatters discuss closed-loop APIs and how AI can be applied to see the holes in the cognitive blind spots of large enterprises. They look at how we can build durable DevOps teams and overcome variables within the natural process of skill acquisition. https://www.xmatters.com/ https://www.mthree.com/
Introducing chaotic, unpredictable test software into your methodical testing regime is a good idea, right? Yes, it's a branch of testing called Chaos Engineering, or Chaos Testing. Netflix's Chaos Monkey famously introduced many of us to the idea that resilient systems, networks and software become more resilient and less brittle if we use chaotic testing methods to find their weak points before customers do. An innovative engineer at xMatters launched a new open source chaos testing tool named after H.P. Lovecraft's nightmarish character Cthulhu. Cthulhu is designed to test across multiple cloud providers, initially supporting Google Cloud with plans to support Amazon Web Services. It's open source, free to use, looking for more contributors, and is available on GitBub. We are joined on this DevOps Chats by Tobias Dunn-Krahn, CTO, and Gabrielle Gasse, lead engineer on the Cthulhu open source project, both at xMatters.
This week we explain everything you need to know about monitoring and compliance. Plus, we review this history of the monolith and how it led to microservices. Forget AWS vs. Azure, it’s WholeFoods vs. H-E-B that’s what will divide families! H-E-B buys Favor. (https://www.americaninno.com/austin/inno-news/texas-grocery-giant-h-e-b-is-acquiring-favors-on-demand-delivery-business/) Amazon extends 5% back Prime credit card benefits to Whole Foods purchases (https://www.geekwire.com/2018/amazon-extends-5-back-prime-credit-card-benefits-whole-foods-purchases/) Hard-hustle & shameless self-promotion Brandon interviews JJ on this weeks Software Defined Interviews (http://www.softwaredefinedinterviews.com/63). (http://www.softwaredefinedinterviews.com/63) Make sure to subscribe Email us at stickers@softwaredefinedtalk.com for free stickers. This episode brought to you by: Datadog! This episode is sponsored by Datadog, a monitoring platform for cloud-scale infrastructure and applications. Built by engineers, for engineers, Datadog provides visibility into more than 200 technologies, including AWS, Chef, and Docker, with built-in metric dashboards and automated alerts. With end-to-end request tracing, Datadog provides visibility into your applications and their underlying infrastructure—all in one place. Sign up for a free trial (https://www.datadoghq.com/lpgs/?utm_source=Advertisement&utm_medium=Advertisement&utm_campaign=SoftwareDefinedTalk-Tshirt-Native). Datadog also offers Forecast Alerts (https://www.datadoghq.com/blog/forecasts-datadog/), which makes it easy to get notified of potential problems before they cause outages. Read more at: https://www.datadoghq.com/blog/forecasts-datadog/ Compliance, Monitoring, Mesh & Design, Oh my! Chef Software bids to automate compliance with new InSpec 2.0 release (https://siliconangle.com/blog/2018/02/20/chef-software-bids-automate-compliance-new-inspec-2-0-release/) Networking ruins everything! History of Service Mesh. (https://thenewstack.io/history-service-mesh/) The RED Method: A New Approach to Monitoring Microservices (https://thenewstack.io/monitoring-microservices-red-method/) A print button? Mmkay. Let's explore WHY you need me to add that (https://www.theregister.co.uk/2018/02/20/design_in_the_age_of_devops/) xMatters snares $40 million Series D led by Goldman Sachs Private Capital Investing (https://techcrunch.com/2018/02/20/xmatters-snares-40-million-series-d-led-by-goldman-sachs-private-capital-investing/) The Road To 400G Ethernet Is Paved With Bechtolsheim’s Intentions (https://www.nextplatform.com/2018/02/21/road-400g-ethernet-paved-bechtolsheims-intentions/) Conferences, et. al. May 15th to 18th, 2018 - Coté talking EA at Continuous Lifecycle London (https://continuouslifecycle.london/sessions/the-death-of-enterprise-architecture-defeating-the-devops-microservices-and-cloud-native-assassins/). DevOpsDays Jakarta (http://devopsdays.org/events/2018-jakarta/) - April 26-27 Matt will be there Derek Mazzone (https://kexp.org/djs/darek-mazzone/) from KEXP ChefConf 2018 (https://chefconf.chef.io/) - May 22-25 in Chicago SxSW (https://www.sxsw.com/) — Brandon in Austin giving out stickers SDT news & hype Check out Software Defined Interviews (http://www.softwaredefinedinterviews.com/), our new podcast. Pretty self-descriptive, plus the #exegesis podcast we’ve been doing, all in one, for free. Keep up with the weekly newsletter (https://us1.campaign-archive.com/home/?u=ce6149b4008d62a08093a4fa6&id=5877922e21). Join us in Slack (http://www.softwaredefinedtalk.com/slack). Buy some t-shirts (https://fsgprints.myshopify.com/collections/software-defined-talk)! DISCOUNT CODE: SDTFSG (20% off) Send your name and address to stickers@softwaredefinedtalk.com (mailto:stickers@softwaredefinedtalk.com) and we will send you a sticker. Recommendations Matt: Oceanic (https://www.amazon.com/gp/product/B00E84GOZY/) by Greg Egan Brandon: (https://www.netflix.com/title/80176878)How to Turn Down a Billion Dollars: The Snapchat Story (https://www.audible.com/pd/Science-Technology/How-to-Turn-Down-a-Billion-Dollars-Audiobook/B079DTHLCJ) related Snap stock plummets after Kylie Jenner declares Snapchat dead (https://www.theverge.com/2018/2/22/17040332/snap-stock-price-kylie-jenner-tweet-snapchat-1-billion-market-loss) Cover Art Credit (https://www.flickr.com/photos/polycart/6899465702/)
Abbas Haider Ali, CTO of xMatters, joins Nice Work! to talk with Lacey about how the challengers to solving incidents at scale. It takes a team of people to solve incidents. Do you know who the right people are or do you just require everyone to join the call? Queue the endless conference call joining meme. Lacey also talks to Abbas about his work mentoring women in tech through the mentor matching service, Everwise.
Many of my guests on the show are Tech leaders that appear on my radar that not only resonate but also inspire or intrigue me. Abbas Haider is the CTO of xMatters, and after seeing a video talking about how internal collaboration is crucial to digital transformation I wanted to find out more. xMatters aim to boost IT Operations, DevOps and Major Incident Management processes with intelligent communications. "Turn Insights into Action During Business Disruptions From IT major incidents to manufacturing line slowdowns, virtually every business process generates insights that need to reach the people required to take action" As most of you listening know by now, I love my tech, but it's how it brings people together and increases collaboration that really excites me. When I also heard him speak my IT language about the major incident and change management along with how automated chatbots can help in the communication process, I had to get him to on the show. Guest Info www.xmatters.com @xMatters_inc