Label | Explanation | Data Type |
Input raster bands | The input raster bands. They can be integer or floating point type. | Raster Layer |
Output signature file | The output signature file. A .gsg extension must be specified. | File |
Number of classes | Number of classes into which to group the cells. | Long |
Number of iterations (Optional) | Number of iterations of the clustering process to run. The default is 20. | Long |
Minimum class size (Optional) | Minimum number of cells in a valid class. The default is 20. | Long |
Sample interval (Optional) | The interval to be used for sampling. The default is 10. | Long |
Available with Spatial Analyst license.
Summary
Uses an isodata clustering algorithm to determine the characteristics of the natural groupings of cells in multidimensional attribute space and stores the results in an output ASCII signature file.
Usage
Iso Cluster performs clustering of the multivariate data combined in a list of input bands. The resulting signature file can be used as the input for a classification tool, such as Maximum Likelihood Classification, that produces an unsupervised classification raster.
When a multiband raster is specified as one of the Input raster bands (in_raster_bands in Python), all the bands will be used.
To process a selection of bands from a multiband raster, you can first create a new raster dataset composed of those particular bands with the Composite Bands tool, and use the result in the list of the Input raster bands (in_raster_bands in Python).
The minimum valid value for the number of classes is two. There is no maximum number of clusters. In general, more clusters require more iterations.
To provide the sufficient statistics necessary to generate a signature file for a future classification, each cluster should contain enough cells to accurately represent the cluster. The value entered for the minimum class size should be approximately 10 times larger than the number of layers in the input raster bands.
The value entered for the sample interval indicates one cell out of every n-by-n block of cells is used in the cluster calculations.
You shouldn't merge or remove classes or change any of the statistics of the ASCII signature file.
Generally, the more cells contained in the extent of the intersection of the input bands, the larger the values for minimum class size and sample interval should be specified. Values entered for the sample interval should be small enough that the smallest desirable categories existing in the input data will be appropriately sampled.
The class ID values on the output signature file start at one and sequentially increase to the number of input classes. The assignment of the class numbers is arbitrary.
Better results will be obtained if all input bands have the same data ranges. If the bands have vastly different data ranges, the data ranges can be transformed to the same range using Map Algebra to perform the equation.
where: Z is the output raster with new data ranges. X is the input raster. oldmin is the minimum value of the input raster. oldmax is the maximum value of the input raster. newmin is the desired minimum value for the output raster. newmax is the desired maximum value for the output raster.
See Analysis environments and Spatial Analyst for additional details on the geoprocessing environments that apply to this tool.
Parameters
IsoCluster(in_raster_bands, out_signature_file, number_classes, {number_iterations}, {min_class_size}, {sample_interval})
Name | Explanation | Data Type |
in_raster_bands [in_raster_band,...] | The input raster bands. They can be integer or floating point type. | Raster Layer |
out_signature_file | The output signature file. A .gsg extension must be specified. | File |
number_classes | Number of classes into which to group the cells. | Long |
number_iterations (Optional) | Number of iterations of the clustering process to run. The default is 20. | Long |
min_class_size (Optional) | Minimum number of cells in a valid class. The default is 20. | Long |
sample_interval (Optional) | The interval to be used for sampling. The default is 10. | Long |
Code sample
This example creates a signature file for classifying the input multiband raster into five classes.
import arcpy
from arcpy import env
from arcpy.sa import *
env.workspace = "C:/sapyexamples/data"
IsoCluster("redlands", "c:/sapyexamples/output/isosig.gsg", 5, 20, 50, 15)
This example creates a signature file for classifying the input multiband raster into five classes.
# Name: IsoCluster_Ex_02.py
# Description: Uses an isodata clustering algorithm to determine the
# characteristics of the natural groupings of cells in multidimensional
# attribute space and stores the results in an output ASCII signature file.
# Requirements: Spatial Analyst Extension
# Import system modules
import arcpy
from arcpy import env
from arcpy.sa import *
# Set environment settings
env.workspace = "C:/sapyexamples/data"
# Set local variables
inRaster = "redlands"
outSig = "redlndiso.gsg"
classes = 5
cycles = 20
minMembers = 50
sampInterval = 15
# Execute IsoCluster
IsoCluster(inRaster, outSig, classes, cycles, minMembers, sampInterval)