Use data engineering

Insights desktop
Note:

Data engineering is available in Insights desktop. All Insights in ArcGIS Online and Insights in ArcGIS Enterprise users have access to Insights desktop. For more information, see Introduction to ArcGIS Insights.

Data engineering is currently in Preview.

Data engineering is a process that includes exploring, visualizing, cleaning, and preparing data for analysis. Data engineering is typically performed before you start an analysis workflow.

Data engineering can be completed in Insights using a data model. The data model is created by running data engineering tools on a dataset or a sample of a dataset. Running the data model applies the tools to the full dataset and creates a new output dataset that is ready to use for analysis.

Example

A GIS analyst is preparing air quality data for analysis in Insights. The analyst loads the data into a data workbook in Insights desktop, which automatically trims all extra spaces from the beginning and end of strings. The analyst uses the Show column summary tool to explore the columns in the dataset and discovers that 9999 is used for missing values. The analyst then uses the Find and replace tool to search for 9999 values and replace them with null values. The analyst also uses the Advanced filter tool to filter the dataset to the desired study area.

Once the analyst is satisfied that the dataset has been prepared for analysis, they can run the data model to create a new output dataset. The analyst decides to save the output as a local dataset in Insights desktop. They can also export the local dataset to another format, such as a compressed shapefile, to share with members of their organization or use in Insights in ArcGIS Online or Insights in ArcGIS Enterprise.

Perform data engineering

Every data engineering workflow will differ slightly based on requirements of the individual datasets. The following workflow can be used as a general guideline for how to perform data engineering in Insights:

  1. Create a data workbook in Insights desktop.
  2. Add data and apply import options if necessary.
  3. Apply dataset and column tools to clean and prepare the data.

    The tools are added to the data model automatically.

  4. Run the data model to create an output dataset.