The Summarize Nearby tool finds features that are within a specified distance of features in the input layer using a line distance or travel mode. Statistics are then calculated for the nearby features.
Note:
Summarize Nearby requires the following directions and routing services to run with travel modes:
- Service Area (Asynchronous)
- Routing Utilities
For more information, see Configure utility services.
Workflow diagram
Examples
A school board is building a new elementary school and needs to decide between several potential locations. They decide that the school should be in the area with the most children aged 5 to 12. The population of children aged 5 to 12 can be calculated with the Summarize Nearby tool using a one-mile walking distance around each potential location and census blocks that include population data.
Terminology
Term | Description |
---|---|
Geodesic | Refers to a line drawn on a sphere. A geodesic line drawn on the globe represents the curvature of the earth's geoid. |
Euclidean distance | A straight-line distance as measured on a flat surface (that is, a Cartesian plane). |
Usage notes
The Summarize Nearby tool requires two input layers: a layer from which distances will be measured and a layer to summarize. Both layers may be points, lines, or areas. The first input layer can contain up to 1,000 features.
The nearby distance can be based on a line distance or several different travel modes using time or distance as the measure. Travel modes are only enabled when the first input layer contains point features. You can create multiple output areas by inputting each time or distance separated by a space.
Some travel modes have an option for using traffic conditions. When traffic conditions are used, they may be based on live conditions or typical conditions for a specified day of the week and time. Live traffic conditions can be offset up to 4 hours from the current time. Times for typical conditions can be set by 15-minute intervals for the entire day and night.
Travel modes can be configured by the administrator of your organization. The administrator can also add new travel modes or remove travel modes that are not necessary for your organization. The default distance measure will be in Miles or Kilometers, depending on the Units setting in your profile.
Travel modes | Description | Defaults | Options |
---|---|---|---|
Walking Time | Follows paths and roads that allow pedestrian traffic and finds solutions that optimize travel time. The walking speed is set to 5 kilometers per hour. | Walking speed is set at 5 kilometers (3.1 miles) per hour. The walking speed can be configured by the administrator of your organization. The default Walking Time is 5 Minutes. | Durations can be in Seconds, Minutes, or Hours. A duration of up to 9 hours can be used as input. |
Rural Driving Distance | Models the movement of cars and other similar small automobiles, such as pickup trucks, and finds solutions that optimize travel distance. Travel obeys one-way roads, avoids illegal turns, and follows other rules that are specific to cars but does not discourage travel on unpaved roads. | The default Rural Driving Distance is 5 Miles (US Standard setting) or 5 Kilometers (Metric setting). | Distances can be in Miles, Yards, Feet, Kilometers, or Meters. A distance of up to 300 miles (482.80 kilometers) can be used as input. |
Driving Time | Models the movement of cars and other similar small automobiles, such as pickup trucks, and finds solutions that optimize travel time. Travel obeys one-way roads, avoids illegal turns, and follows other rules that are specific to cars. When you specify a start time, dynamic travel speeds based on traffic are used where it is available. Verify whether traffic data is available in your region by clicking the See availability link in the tool pane. | The default Driving Time is 5 Minutes. Use traffic is unchecked by default. The driving speed will be based on historical and live traffic data. | Durations can be in Seconds, Minutes, or Hours. A duration of up to 5 hours can be used as input. Traffic can be based on live conditions or typical conditions for a specified day of the week and time. |
Driving Distance | Models the movement of cars and other similar small automobiles, such as pickup trucks, and finds solutions that optimize travel distance. Travel obeys one-way roads, avoids illegal turns, and follows other rules that are specific to cars. | The default Driving Distance is 5 Miles (US Standard setting) or 5 Kilometers (Metric setting). | Distances can be in Miles, Yards, Feet, Kilometers, or Meters. A distance of up to 300 miles (482.80 kilometers) can be used as input. |
Walking Distance | Follows paths and roads that allow pedestrian traffic and finds solutions that optimize travel distance. | The default Walking Distance is 5 Miles (US Standard setting) or 5 Kilometers (Metric setting). | Distances can be in Miles, Yards, Feet, Kilometers, or Meters. A distance of up to 27 miles (43.45 kilometers) can be used as input. |
Rural Driving Time | Models the movement of cars and other similar small automobiles, such as pickup trucks, and finds solutions that optimize travel time. Travel obeys one-way roads, avoids illegal turns, and follows other rules that are specific to cars but does not discourage travel on unpaved roads. When you specify a start time, dynamic travel speeds based on traffic are used where it is available. Verify whether traffic data is available in your region by clicking the See availability link in the tool pane. | The default Rural Driving Time is 5 Minutes. Use traffic is unchecked by default. The rural driving speed will be based on historical and live traffic data. | Durations can be in Seconds, Minutes, or Hours. A duration of up to 5 hours can be used as input. Traffic can be based on live conditions or typical conditions for a specified day of the week and time. |
Trucking Time | Models basic truck travel by preferring designated truck routes and finds solutions that optimize travel time. Routes must obey one-way roads, avoid illegal turns, and so on. The trucking speed will be based on either historical average speeds for automobiles or the posted speed limits for trucks, whichever is smaller. Follows rules applicable to heavy trucks. | The default Trucking Time is 5 Minutes. Use traffic is unchecked by default. The trucking speed will be based on historical and live traffic data but will not exceed the posted trucking speed. | Durations can be in Seconds, Minutes, or Hours. A duration of up to 5 hours can be used as input. Traffic can be based on live conditions or typical conditions for a specified day of the week and time. |
Trucking Distance | Models basic truck travel by preferring designated truck routes and finds solutions that optimize travel distance. Routes must obey one-way roads, avoid illegal turns, and so on. Follows rules applicable to heavy trucks. | The default Trucking Distance is 5 Miles (US Standard setting) or 5 Kilometers (Metric setting). | Distances can be in Miles, Yards, Feet, Kilometers, or Meters. A distance of up to 300 miles (482.80 kilometers) can be used as input. |
The Return bounding areas box is checked by default. The result layer includes the area or areas created using the specified distance measurement around the input layer as well as all of the requested summarized data. When unchecked, the result layer is the same as the input layer but includes the summarized data.
A Count of Points, Total Length, or Total Area box appears depending on the type of features to summarize in your layer. The box is checked by default and can only be unchecked if statistics are being calculated.
Calculation | Input features | Default | Options |
---|---|---|---|
Count of Points | Points | None | None |
Total Length | Lines | Miles (US Standard setting) or Kilometers (Metric setting) |
|
Total Area | Areas | Square Miles (US Standard setting) or Square Kilometers (Metric setting) |
|
There are five options for statistics that you can calculate on numeric fields in the layer to be summarized: sum, minimum, maximum, average, and standard deviation. Each time a Field and Statistic are entered, a new row is added to the tool pane so that more than one statistic can be calculated at once. You can view the summarized data in the result layer's table or pop-ups.
Optionally, you can select a group by field so that statistics are calculated separately for each unique attribute value. When a group by field is selected, the pop-up for each feature in the output layer contains charts showing each summary count or total and statistic by field value. A summary table listing each feature and statistic by group by field value is also created.
The Add minority, majority and Add percentages check boxes are enabled when a group by field is entered. The minority and majority values are the least and most dominant values, respectively, from the group by field, where dominance is determined using the count of points, total length, or total area of each value. When Add minority, majority is checked, two new fields are added to the result layer. The fields list the values from the group by field that are the minority and majority for each result feature. When Add percentages is checked, a new field is added to the result table listing the percentage of the count of points, total length, or total area within each feature that belongs to each value from the group by field. If Add minority, majority is also checked, two additional fields are added to the result layer listing the percentage of the count of points, total length, or total area that belongs to the minority and majority values for each feature.
If Use current map extent is checked, only the features in the input layer and the layer to be summarized that are visible within the current map extent will be analyzed. If unchecked, all features in both the input layer and the layer to be summarized will be analyzed, even if they are outside the current map extent.
Limitations
- The first input layer from which distances are measured has a limit of 1,000 features.
- Travel modes are only enabled when point features are used as the input layer from which distances are measured.
- Travel times cannot exceed 9 hours (540 minutes) when walking or 5 hours (300 minutes) for all other travel times.
- Travel distances cannot exceed 27 miles (43.45 kilometers) when walking or 300 miles (482.8 kilometers) for all other travel distances.
- Lines and areas are summarized using proportions; it is best to summarize absolute data (such as population) rather than relative data (such as average income) when lines or areas are being summarized.
- An error occurs if the tool takes more than 60 minutes to execute when using travel modes. If this error occurs, try rerunning the analysis with fewer input features.
How Summarize Nearby works
The sections below describe the functionality of the Summarize Nearby tool.
Equations
Average and Std Deviation are calculated using weighted mean and weighted standard deviation for line and area features. None of the statistics for point features are weighted. The following table shows the equations used to calculate standard deviation, weighted mean, and weighted standard deviation:
Statistic | Equation | Variables | Features |
---|---|---|---|
Standard Deviation | Points | ||
Weighted Mean | Lines and Areas | ||
Weighted Standard Deviation | Lines and Areas |
Note:
Null values are excluded from all statistical calculations. For example, the mean of 10, 5, and a null value is 7.5 ((10+5)/2).
Points
Point layers are summarized using only the point features within the boundary drawn by the Summarize Nearby tool. If Return bounding areas is checked, the results layer is displayed using graduated symbols.
The figure and table below show the statistical calculations of a point layer. In the hypothetical analysis in the figure, a point layer is being summarized using a line distance from an input point. The Classes field was used to calculate the statistics (Sum, Minimum, Maximum, Average, and Std Deviation) for the layer.
Statistic | Result |
---|---|
Sum |
|
Minimum | Minimum of :
|
Maximum | Maximum of :
|
Average |
|
Std Deviation |
|
A real-life scenario in which you can use this analysis is determining the availability of yoga classes within a two-mile radius of your house. Each point represents a gym or studio that offers yoga classes. The Classes field contains the number of unique classes offered at each point. From the results, you can see that there are seven facilities offering classes within two miles of your house and that the maximum number of unique classes at a single facility is three.
Lines
Line layers are summarized using only the proportions of the line features that are within the boundary drawn by the Summarize Nearby tool. When summarizing lines, use fields with absolute numbers so proportional calculations make logical sense in your analysis. If Return bounding areas is checked, the results layer is displayed using graduated symbols.
The figure and table below show the statistical calculations of a line layer. In the hypothetical analysis in the figure, a line layer is being summarized using a line distance from an input point. The Houses field was used to calculate the statistics (Sum, Minimum, Maximum, Average, and Std Deviation) for the layer. The statistics are calculated using only the proportion of the line that is within the specified distance of the input point.
Statistic | Result |
---|---|
Sum |
|
Minimum | Minimum of :
|
Maximum | Maximum of :
|
Average |
|
Std Deviation |
|
A real-life scenario in which you can use this analysis is determining how many houses are within five miles of a remote forest fire. Each line segment represents a street or lane with houses built along it. The Houses field contains the number of houses on each segment. From the results, you can see that there are seven houses within five miles of the fire.
Areas
Area layers are summarized using only the proportions of the area features that are within the boundary drawn by the Summarize Nearby tool. When summarizing areas, use fields with absolute numbers so proportional calculations make logical sense in your analysis. If Return bounding areas is checked, the results layer is displayed using graduated colors.
The figure and table below show the statistical calculations of an area layer. In the hypothetical analysis in the figure, an area layer is being summarized using a line distance from an input point. The Tables field was used to calculate the statistics (Sum, Minimum, Maximum, Average, and Std Deviation) for the layer. The statistics are calculated using only the proportion of the area that is within the specified distance of the input point.
Statistic | Result |
---|---|
Sum |
|
Minimum | Minimum of:
|
Maximum | Maximum of:
|
Average |
|
Std Deviation |
|
A real-life scenario in which you can use this analysis is determining how many picnic tables there are within a 100-foot radius of different campsites. Each area represents a picnic park and the Tables field contains the number of picnic tables in each park. From the results, you can see that there are seven tables near this specific campsite and the most picnic tables at a single picnic park is three.
Line distance
The Summarize Nearby tool uses a geodesic method when finding features with a straight-line distance, rather than a Euclidean method. Geodesic lines account for the actual shape of the earth (an ellipsoid, or more properly, a geoid). Distances are calculated between two points on a curved surface (the geoid) as opposed to two points on a flat surface (the Cartesian plane).
Travel modes
Service Areas use Dijkstra's algorithm to traverse a network. The service returns a subset of connected edge features that are within a specified travel distance or travel time. Travel areas are created by inputting the edges into a triangulated irregular network (TIN) data structure. The network distance along the lines serves as the height of the locations inside the TIN. Locations not traversed by the service area are put in with a much larger height value. An area-generation routine is used with this TIN to carve out regions encompassing areas in between the specified break values (travel time or travel distance).
The Live traffic option uses the current time as the departure. The traffic speed is predicted for the trip using live speeds, historical speeds, and current events, such as weather.
When typical conditions for a day of the week and time are being used, the travel speeds are based on historical speeds averaged across 5-minute intervals for the entire week. The selected time corresponds to local time in the time zone that your data is in.
When either traffic condition is used, the Summarize Nearby tool takes into consideration the changing traffic conditions based on elapsed time from departure.
Similar tools
Use Summarize Nearby to calculate statistics on point, line, or area features within close proximity of another input layer. Other tools may be useful in solving similar but slightly different problems.
Map Viewer analysis tools
If you have predefined boundaries in which you want to summarize a point, line, or area layer, use the Summarize Within tool.
If you have predefined boundaries in which you want to summarize a point layer and you want to keep only the areas containing one or more points, use the Aggregate Points tool.
ArcGIS Desktop analysis tools
Summarize Nearby is also available in ArcGIS Pro. To run the tool from ArcGIS Pro, your project's active portal must be running Portal for ArcGIS 10.5 or later. You must also sign in to the portal using an account that has privileges to perform standard feature analysis in the portal.