Glossary

A repository of acronyms, jargon, and useful words for product and customer teams

  • A
  • B
  • C
  • D
  • E
  • F
  • G
  • H
  • I
  • J
  • K
  • L
  • M
  • N
  • O
  • P
  • Q
  • R
  • S
  • T
  • U
  • V
  • W
  • X
  • Y
  • Z
A

Anomaly Detection

Anomaly detection is the process of finding data points that are outliers from the rest of a data set.

Read full description

Apache Arrow

Apache Arrow a language-agnostic software framework for developing data analytics applications that process columnar data. It contains a standardized column-oriented memory format that is able to represent flat and hierarchical data for efficient analytic operations.

Read full description

Apache Arrow Flight SQL

Apache Arrow Flight SQL provides a high performance SQL interface for working with databases over a network

Read full description

Apache DataFusion

DataFusion is an in-memory query planning, optimization, and execution framework. DataFusion was created in 2017 and donated to the Apache Arrow project in 2019.

Read full description

Apache Parquet

Apache Parquet is an open source columnar data file format that supports different encoding and compression schemes to optimize it for efficient data storage and retrieval in bulk.

Read full description

ARIMA

An Autoregressive Integrated Moving Average (ARIMA) model is a widely used time series forecasting technique.

Read full description
C

CAP Theorem

CAP theorem is a computer science theory related to the tradeoffs involved with designing distributed databases.

Read full description

Cardinality

In the context of databases cardinality is the number of unique sets of data stored in a database. Specifically, it refers to the total number of unique values possible within a table column or database equivalent.

Read full description

Column Database

Column databases are a type of DBMS that store data formatted in columns rather than rows and are optimized for analytics workloads.

Read full description
D

Data Warehouse

A data warehouse is a data management system that supports business intelligence, such as data analysis. Data warehouses help you make more insightful decisions about your business.

Read full description

Database as a Service (DBaaS)

Database-as-a-service (DBaaS) is a cloud computing service that provides access to a cloud database system without needing to set up, configure, or manage software or physical infrastructure.

Read full description

Distributed Tracing

Tracing is a method for understanding how interconnected components of a distributed system interact with each other.

Read full description
E

Edge Computing

Edge computing is a type of computing that happens near a data source. It allows you to perform computing tasks as close to an IoT device or end user as possible instead of using a data center or the cloud.

Read full description

ETL (Extract, Transform, Load)

ETL stands for Extract, Transform, Load and is the process of moving and manipulating data from different sources before storing it in another database.

Read full description
I

IoT Devices

The Internet of Things refers to the network of interconnected "things" with sensors, software, processing ability, and other technologies that connect and exchange data with other internet-connected devices. IoT devices include smartphones, medical sensors, fitness trackers, smart security systems, and other technologies.

Read full description
O

Online Analytical Processing (OLAP)

Online Analytical Processing (OLAP) is an approach to working with typically multidimensional data for analytics use cases.

Read full description

OPC Unified Architecture (OPC UA)

OPC UA is a cross-platform standard for moving data between sensors and cloud applications.

Read full description
P

Prometheus metrics

Prometheus stores four metric types for monitoring needs: counters, gauges, histograms, and summaries.

Read full description
R

Real User Monitoring

Real User Monitoring(RUM) is the process of collecting user data to gain insights into your application's performance and how it is being used.

Read full description
S

SCADA (Supervisory Control And Data Acquisition)

SCADA stands for Supervisory Control and Data Acquisition. A SCADA system is usually a collection of both software and hardware components that allow supervision and control of industrial plants.

Read full description

Seasonality

Seasonality is the presence of regular and predictable change in time series data.

Read full description

Stationarity

Stationarity refers to a time series where the statistical properties of that series don’t depend on the time when observing it.

Read full description

SQL

SQL is a domain specific language used in programming and designed for managing data held in a relational database management system.

Read full description