Transforming NetOps Workflows From Manual to Automated

Staying ahead of problems is crucial in today’s fast-paced IT environment, and integrating automation and AI into Network Operations (NetOps) workflows plays a vital role in achieving this. By adopting “Shift Left” methodologies, organizations can proactively tackle challenges in troubleshooting, change management, and observability. This approach enables AI-driven ticketing systems to initiate auto-diagnosis, facilitates auto-remediation through pre-built runbooks, and ensures comprehensive visibility into past outages with real-time alerting dashboards. Embracing these innovations empowers teams to streamline operations and enhance network resilience.

Shift Left Network Operations Workflows

• Troubleshooting - AI-driven ticket triggered auto-diagnosis execution and result generation
• Change – Auto-remediation for change with pre-built runbooks
• Observability – Enforce network with past outages observability dashboard with alerting

Machine

NetBrain Automation + AI

  • Ticket creation
  • Map creation
  • Auto-diagnose known incidents
  • Auto-close tickets
  • Auto-prioritize tickets
  • Dashboard creation
Operator

NetBrain Automation + AI

  • Use AI to leverage self-service troubleshooting automation
  • Post learnings in a collaboration portal for escalations
  • Transfer knowledge efficiently between shifts
L1 Engineer

NetBrain Automation + AI

  • Use automation and maps to troubleshoot initial steps of basic problems
  • Use a collaboration portal for escalations
  • Leverage Incident dashboards
  • Expedite daily pre-checks, checks, & post-checks
L2 Engineer

NetBrain Automation + AI

  • Leverage Incident dashboards for incident record
  • Use collaboration portal
  • Leverage and build no-code automation to troubleshoot complex problems

Systematically Shift Left Four Workflows

Fusion of AI and Automation

By integrating Automation and AI into your NetOps workflows, you can seamlessly ‘Shift Left’ to enhance efficiency and responsiveness. Explore the capabilities of NetBrain AI Co-Pilot to discover the specific automation tasks it can perform, streamlining your operations and allowing your team to focus on higher-level strategic initiatives.

NetBrain’s Co-Pilot with Agentic AI uses a LLM to understand the context from your plain language input to trigger the right agents. These agents are functions that call the correct internal APIs to get each task done. This makes troubleshooting and automation much more efficient.

What automation can you ask NetBrain AI Co-Pilot to perform?

Advise
Digital Twin
A live mathematical model of your network’s intent, traffic forwarding, topology and device data.
Action Plans
Give the bot network automation steps to execute.
Orchestrate Automation
Natural Language
Query, interpret and summarize results in natural language
AI Co-Pilot Tools
Intelligent CLI
Execute CLI commands with AI intelligence
Automation
Trigger Intent based automation from the bot and return results to the bot and the map
Action Plan
Company diagnosis guidebook as bot knowledge
Database
Query database with natural language, such as IP, DNS, neighbor pair, digital twin, ADT

Use Cases for NetBrain Co-Pilot

Inventory & Data Lookup

IP Lookup
1. Translate IP to device and interface if matched
2. Translate to 1st hop L3 Gateway and L2 Gateway (One-IP Table Lookup)
3. Same Subnet Device

DNS Lookup
Translate a DNS name to an IP

Neighbor Lookup
Lookup a devices neighbor(L2, L3, IPv6)

Digital Twin Database Lookup
Query NetBrain database for data fields with reasoning

Automation Data Table Lookup
Lookup data stored in an ADT

Troubleshooting

Issue CLI Command and analyze results

Execute Network Intent for scheduled or immediate action

Run steps to troubleshoot BGP on a core router

Check the uptime of all devices and draw on mapped devices

Check all devices on map for logs with the word “error” and summarize results

Check Config Drift by Device on devices on the map and summarize results in a table form by device and result

Security

Investigate for software vulnerabilities in my network. Take the devices on the map and list the software versions

Assess the Network Observability for CVE Security on devices present on the map -> based on Automation/Observability/ Observability

For CVE Security Output = NI results
Now run $intent on map devices and display results in a dashboard with the dashboard group “check CVE vulnerabilities

Change

Before a change, capture the routing summary state for devices on the map using the “show IP route summary” CLI command. Print the results in a table and record the state for comparison with post-change data omitting “Replicates”, “Overhead”, “Memory”, “NHRP” and “internal” from the output.

Assess

Issue command “show version” on devices on a map, find uptime, version, reboot reason, then draw result output on a map. Also, print results in a tabular format and highlight any device that has rebooted in the last 24 hours.

Mapping

Get device details from devices on the map

Check and summarize QoS drops on the devices on the map

Draw device, note, arrow on map with the output data

Benefits of AI-Driven Network Automation with NetBrain AI Co-Pilot

  • Natural Language Automation Integration: Query, interpret, and summarize results in natural language.
  • Repeatable Workflows: Orchestrates pre-built playbooks, called action plans, based on previous experiences and past outages and reuses them consistently to generate repeatable, predictable results.
  • Built-In Intelligent CLI: Execute CLI Commands with AI Intelligence.
  • Accurate Data Retrieval: Dynamically get inventory from the live Digital Twin and stored device data.
  • Full Visibility: Draw devices, neighbor relationships, features, and application paths on any map.
  • User-Friendly Troubleshooting: Reduce MTTI/MTTR and shift troubleshooting left.
  • Safer Change Management: Perform pre and post-checks and monitor critical systems while implementing the change.
  • Actionable Observability: Export results of intent-driven automation to observability dashboards for diagnostic assessment sharing and archival.
  • Predictive Analysis: Analyze historical data and network behavior to predict the potential impact of changes. By identifying risks early, you can help prevent outages and performance issues.
  • Enhanced Compliance and Reporting: Ensure all changes adhere to policy and compliance requirements.