Observability

Bản phát hành ứng viên

A version of software or integration package intended for final testing before production release; considered stable and potentially ready for deployment if no major issues are found.

View term

Bảng điều khiển giám sát

A centralized, configurable visual interface presenting real-time and historical operational data, including metrics, logs, alerts, and system status for observability and decision support.

View term

Chính sách cảnh báo

A defined set of rules specifying when, how, and to whom alerts are sent for monitored resources or systems, supporting consistent incident response.

View term

Chính sách leo thang

A documented protocol specifying how, when, and to whom alerts or incidents are elevated if initial response targets are unmet, ensuring timely mitigation and accountability.

View term

Chỉ số vận hành

A quantifiable measurement used to assess, monitor, and optimize the performance, availability, or reliability of IT systems, integrations, and operational processes.

View term

Cảnh báo hiệu suất

An automated system notification triggered when monitored performance metrics (e.g., CPU, latency, error rates) exceed pre-defined operational thresholds, indicating potential system degradation.

View term

Cảnh báo vận hành

A system-generated notification or escalation indicating a potential or actual deviation from expected operational thresholds, requiring investigation or response.

View term

Cầu nối sự cố

A real-time virtual or phone conference line used by IT operations and engineering teams to coordinate during critical incidents, outages, or high-severity service disruptions.

View term

Dòng thời gian sự cố

A chronological record of key events, actions, alerts, and communications during an incident, enabling postmortem analysis and process improvement.

View term

Giao hàng liên tục

A DevOps practice ensuring that code is always in a deployable state and can be released to production at any time with minimal manual intervention.

View term

Giám sát thời gian hoạt động

A monitoring system or tool that continuously checks the availability and operational status of services or infrastructure endpoints, typically reporting outages or downtime.

View term

Giám sát tổng hợp

A proactive monitoring technique that simulates user interactions or transactions using automated scripts to test system availability, performance, and functionality from various locations.

View term

Gửi log

The automated process of transmitting log files or events from source systems to centralized storage, analytics, or monitoring platforms for retention and analysis.

View term

Hoàn tác dịch vụ

The process of reverting a deployed service, application, or configuration to a previous stable state following a failed release or incident.

View term

Hành trình người dùng

A mapped sequence of steps or interactions a user undertakes within an application or service, tracked in monitoring and analytics to assess experience, performance, and incident impact.

View term

Khung thời gian phát hành

A predefined, scheduled time period during which changes, deployments, or updates are allowed to be released to production systems, minimizing operational risk and ensuring oversight.

View term

Khám phá dịch vụ

The automated identification and registration of network services and endpoints in dynamic environments, enabling real-time inventory and routing for integration and monitoring.

View term

Khả năng quan sát vận hành

The real-time ability to observe, track, and understand the health, performance, and dependencies of services and integrations across an IT environment.

View term

Khắc phục trôi

The process of correcting or reverting configuration, infrastructure, or policy drift to realign with the approved baseline or desired state.

View term

Kiểm soát thay đổi

A formalized process and policy for requesting, reviewing, approving, and implementing changes to systems, services, or infrastructure to minimize operational risk and ensure traceability.

View term

Kiểm tra dịch vụ

A scheduled or automated probe that verifies the operational health or expected behavior of a monitored service, endpoint, or integration.

View term

Kiểm tra hiệu năng

A formal process of simulating user loads or transactions to measure application, service, or infrastructure responsiveness, stability, and scalability under various conditions.

View term

Kiểm tra trạng thái

An automated operation or request that determines whether a service, application, or integration endpoint is available and performing within defined operational thresholds. Commonly used in continuous monitoring, platform health dashboards, and orchestration systems.

View term

Kênh cảnh báo

A configured communication path (such as email, SMS, chat, or incident management platform) through which monitoring or observability systems send automated notifications or alerts to responsible teams.

View term

Leo thang cảnh báo

A predefined process or rule set that determines how alerts are prioritized, routed, and escalated to higher-level teams or stakeholders when initial response targets are not met.

View term

Liên kết sự kiện

The process of analyzing and linking related monitoring or log events to identify patterns, root causes, and reduce alert noise in incident management.

View term

Luồng sự kiện

A continuous, real-time flow of structured events emitted by systems, services, or integrations for monitoring, analytics, and automation in observability pipelines.

View term

Lưu giữ dữ liệu

The operational policy and timeframe for preserving observability, monitoring, and system data in logs, traces, or metrics, as mandated by business or regulatory requirements.

View term

Lấy mẫu trace

A method for selectively collecting and storing a subset of distributed traces from monitored systems to balance observability with storage and cost efficiency.

View term

Miền lỗi

A distinct section of infrastructure, service, or application whose failure is isolated and does not impact other domains, enabling targeted fault tolerance and risk management.

View term

Môi trường kiểm thử

A controlled non-production environment used to execute tests on new code, configurations, or integrations before deploying to production.

View term

Mức độ nghiêm trọng của sự cố

A standardized classification that rates the impact and urgency of an incident, guiding triage, prioritization, and escalation procedures in incident management.

View term

Nguyên nhân gốc rễ

The fundamental reason or underlying issue that leads to a service incident, failure, or recurring anomaly, determined through root cause analysis (RCA) processes in IT operations and monitoring.

View term

Ngân sách lỗi

A pre-defined, quantifiable allowance for acceptable error or downtime in a service, calculated as the difference between 100% and the target Service Level Objective (SLO). Used in SRE to balance reliability and release velocity.

View term

Ngưỡng chỉ số

A defined numerical limit or value set for a monitored metric which, when breached, triggers alerts or automated remediation actions.

View term

Ngưỡng cảnh báo

A predefined value or condition which, when exceeded by a monitored metric, triggers an automated alert to notify teams of a potential incident or risk.

View term

Nhận thức vận hành

Actionable, data-driven understanding of system health, performance, and risk, derived from the continuous analysis of metrics, logs, traces, and events across all monitored environments.

View term

Nhật ký thay đổi

A systematically maintained record of all changes, updates, and modifications made to systems, integrations, or monitored environments, typically used for audit and troubleshooting.

View term

Phiếu sự cố

A formal record created in ITSM or monitoring platforms that logs details of an operational incident, tracks status, and supports investigation, communication, and resolution.

View term

Phát hiện bất thường

The automated identification of abnormal patterns, outliers, or deviations from expected behavior in monitored metrics, logs, or events, indicating potential incidents or performance issues.

View term

Phát hiện trôi

The identification of unintended changes or deviations in configuration, state, or baseline that can lead to non-compliance, vulnerabilities, or operational risk.

View term

Phân tích trace

The examination of distributed transaction traces across services and systems to diagnose latency, errors, or bottlenecks, providing end-to-end observability in complex integration environments.

View term

Phê duyệt thay đổi

A formal process in which proposed changes to systems or integrations are reviewed and authorized by designated stakeholders before implementation.

View term

Phản hồi liên tục

An ongoing loop of automated or manual input gathered from users, systems, or integrations, used to improve reliability, performance, and service delivery.

View term

Phản hồi sự cố

A structured process for detecting, analyzing, containing, and resolving service incidents, following predefined protocols for communication, escalation, and recovery.

View term

Quy tắc cảnh báo

A specific, configurable criterion or threshold in monitoring tools that, when met or exceeded, triggers an automated alert.

View term

Quản lý trạng thái

The set of practices, tooling, and monitoring that track, synchronize, and manage the current operational status or configuration of systems and integrations.

View term

Slot triển khai

An isolated environment or instance in a deployment platform (e.g., Azure App Service) allowing multiple app versions to be staged, tested, or swapped without impacting production.

View term

Sử dụng tài nguyên

The measurement and analysis of how computing resources such as CPU, memory, storage, or network bandwidth are consumed by systems, services, or workloads.

View term

Sự kiện ngừng hoạt động

An occurrence—planned or unplanned—where a system, service, or application becomes unavailable, recorded for compliance, SLA, and incident review purposes.

View term

Thu thập chỉ số

The continuous process of gathering quantitative measurements (metrics) from applications, services, or infrastructure components to enable performance analysis, alerting, and operational decision-making.

View term

Thông báo ngừng hoạt động

An automated or manual alert that informs users, stakeholders, or integrated systems about planned or unplanned service outages, including expected impact and resolution updates.

View term

Thời gian hoạt động hệ thống

The percentage or absolute amount of time a monitored system, service, or application is available and operational, typically over a specific reporting period.

View term

Thời gian phản hồi

The duration between a service request and its corresponding response, tracked as a critical performance and SLO metric.

View term

Thời gian thực

An operational mode or system capability in which data is processed, analyzed, and acted upon instantly or with minimal delay, supporting immediate decision-making and alerting.

View term

Tiện ích bảng điều khiển

A modular, configurable interface component that visualizes real-time metrics, logs, alerts, or status data on a monitoring dashboard for at-a-glance operational insight.

View term

Trang trạng thái

A public or internal web page that displays real-time and historical status information, incident notifications, and availability metrics for monitored systems, applications, or integrations.

View term

Triển khai liên tục

A DevOps practice in which code changes are automatically built, tested, and deployed to production environments without manual intervention.

View term

Trôi cấu hình

A condition where system, application, or infrastructure configurations deviate from the defined baseline or intended state, often due to manual changes or failed automation.

View term

Trôi cấu hình

The deviation of a system's actual configuration from the defined baseline or intended state, often due to manual changes, misapplied automation, or update failures.

View term

Tích hợp cảnh báo

The connection of alerting systems with other tools or platforms (e.g., chat, incident management, ticketing) to enable automated, end-to-end incident response workflows.

View term

Tạo phẩm build

A file, binary, container image, or package produced as the output of a build process, which is versioned and deployed in integration pipelines.

View term

Tổng hợp log

The process of collecting, centralizing, and storing log data from multiple sources (applications, services, infrastructure) into a unified repository for analysis, troubleshooting, and compliance.

View term

Tự động mở rộng

Automated adjustment of computing resources such as server instances or containers, in real time, based on system load or traffic metrics.

View term

Tỷ lệ lỗi

The proportion of system or application errors, faults, or exceptions encountered during operation, often used as a service health metric.

View term

Tỷ lệ thất bại

The frequency or proportion of unsuccessful transactions, jobs, or service requests compared to the total, measured over a defined period in monitoring systems.

View term

Vùng khả dụng

A logically isolated location within a cloud region designed with independent power, networking, and cooling to increase service resilience and fault tolerance.

View term

Điều phối dịch vụ

The automated coordination and management of multiple services, microservices, or workflows to achieve complex integration or business processes.

View term

Điều phối phát hành

The coordination and automation of deployment steps, approvals, and testing in a release process to ensure smooth, controlled, and auditable production rollouts.

View term

Đo từ xa dịch vụ

The automated collection, transmission, and analysis of health, performance, and usage data from services or integrations for observability, monitoring, and optimization.

View term

Đánh giá sau sự cố

A formal retrospective analysis conducted after a major incident or outage to document causes, impacts, and corrective actions for operational improvement.

View term

Đóng băng phát hành

A scheduled or emergency halt in production releases, updates, or deployments to preserve system stability during critical periods.

View term

Đăng ký dịch vụ

A centralized database or directory where details about available services, their instances, and network locations are stored for discovery and health monitoring.

View term

Đường chuẩn hiệu suất

A set of standard, historical performance measurements used as a reference to identify anomalies, regressions, or improvements in system and service behavior.

View term

Đường chuẩn hệ thống

A documented set of expected system configurations, performance levels, and operational characteristics against which deviations, drifts, or anomalies can be measured.

View term

Đường cơ sở chỉ số

A reference value or set of typical ranges established for key operational metrics, used to detect anomalies and trigger alerts if deviations occur.

View term

Đường ống dữ liệu

A managed sequence of data processing steps that moves, transforms, and analyzes data from source systems to destinations, supporting real-time analytics, monitoring, and integration.

View term

Độ bao phủ kiểm thử

A quantitative measure of how much of a codebase or system is exercised by automated or manual tests, typically expressed as a percentage.

View term

Độ trễ dịch vụ

The measurable delay between a service request and its corresponding response, typically tracked as a key performance indicator in monitoring and SLO compliance.

View term

Đột biến độ trễ

A sudden, often short-lived increase in response time or processing delay in services, networks, or integrations, potentially signaling overload, bottlenecks, or emerging incidents.

View term

Languages

Bản phát hành ứng viên

Bảng điều khiển giám sát

Chính sách cảnh báo

Chính sách leo thang

Chỉ số vận hành

Cảnh báo hiệu suất

Cảnh báo vận hành

Cầu nối sự cố

Dòng thời gian sự cố

Giao hàng liên tục

Giám sát thời gian hoạt động

Giám sát tổng hợp

Gửi log

Hoàn tác dịch vụ

Hành trình người dùng

Khung thời gian phát hành

Khám phá dịch vụ

Khả năng quan sát vận hành

Khắc phục trôi

Kiểm soát thay đổi

Kiểm tra dịch vụ

Kiểm tra hiệu năng

Kiểm tra trạng thái

Kênh cảnh báo

Leo thang cảnh báo

Liên kết sự kiện

Luồng sự kiện

Lưu giữ dữ liệu

Lấy mẫu trace

Miền lỗi

Môi trường kiểm thử

Mức độ nghiêm trọng của sự cố

Nguyên nhân gốc rễ

Ngân sách lỗi

Ngưỡng chỉ số

Ngưỡng cảnh báo

Nhận thức vận hành

Nhật ký thay đổi

Phiếu sự cố

Phát hiện bất thường

Phát hiện trôi

Phân tích trace

Phê duyệt thay đổi

Phản hồi liên tục

Phản hồi sự cố

Quy tắc cảnh báo

Quản lý trạng thái

Slot triển khai

Sử dụng tài nguyên

Sự kiện ngừng hoạt động

Thu thập chỉ số

Thông báo ngừng hoạt động

Thời gian hoạt động hệ thống

Thời gian phản hồi

Thời gian thực

Tiện ích bảng điều khiển

Trang trạng thái

Triển khai liên tục

Trôi cấu hình

Trôi cấu hình

Tích hợp cảnh báo

Tạo phẩm build

Tổng hợp log

Tự động mở rộng

Tỷ lệ lỗi

Tỷ lệ thất bại

Vùng khả dụng

Điều phối dịch vụ

Điều phối phát hành

Đo từ xa dịch vụ

Đánh giá sau sự cố

Đóng băng phát hành

Đăng ký dịch vụ

Đường chuẩn hiệu suất

Đường chuẩn hệ thống

Đường cơ sở chỉ số

Đường ống dữ liệu

Độ bao phủ kiểm thử