-
Notifications
You must be signed in to change notification settings - Fork 5
/
Copy pathm102-lance-modis-nrt-global-flood.qmd
566 lines (396 loc) · 28.5 KB
/
m102-lance-modis-nrt-global-flood.qmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
---
title: "MODIS NRT Global Flood Product"
author:
- "SCHOOL Development and Open Science Teams"
date: "April 14, 2024"
bibliography: "references/nrt-flood-references.bib"
---
# LANCE MODIS Near Real Time (NRT) Global Flood Product
![MODIS NRT FLOOD](https://www.earthdata.nasa.gov/sites/default/files/imported/Flood_mekong.png)[^1]
[^1]: Photo Credit, NASA OESDIS.
## Overview
In this lesson, you will use R to take a closer look at the data from the **LANCE MODIS Near Real Time (NRT) Global Flood Product**, including learning about what are LANCE and MODIS, and the NRT Flood products available. You will then learn to select, download, and visualize one of the NRT Flood layers available. Lastly, we will use the NRT Flood data to calculate the area of each cropland type found in the flood zone, and plot the top ten results as a bar graph.
::: column-margin
::: {.callout-tip style="color: #5a7a2b;"}
## Coding Review
This lesson uses the [R](https://www.r-project.org/about.html) language and environment. R is a popular language used for statistical computing and graphics.
:::
:::
## Learning Objectives
After completing this lesson, you should be able to:
- Determine what NRT raster data is available by navigating the LANCE website.
- Read a tile map and select a raster tile to download based on a point of interest.
- Download near-real-time raster data using the application programming interfaces (APIs).
- Preview the downloaded raster data.
- Classify and place on a map the NRT flood spatial data to determine areas with unusual flooding.
- Subset data, perform zonal statistics using spatial data, and graph analysis results.
## Introduction
Atmospheric circulation, water evaporation, and their interactions with land surfaces can impact a region’s rainfall variability. For example, California’s winter is correlated with ocean evaporation near the West Coast and eastern North Pacific, and ocean evaporation is a strong factor in increased flooding in the region [@wei2016; @ucmp2022]. Additionally, drought in the region is associated with high-pressure systems off the U.S. West Coast, with studies showing that high-pressure system are linked to the Pacific sea surface temperature anomalies, and exacerbated by high evaporation over land due to high temperatures [@wang2014; @hartmann2015; @seager2015].
It is critical to understand the water cycle and how flooding events develop, particularly as climate change intensifies extreme weather events, the impacts of flooding can be a risk to human life and can disrupt infrastructure, agriculture, and natural habitats.
![](docs/images/FL_Flood_Aerials-8641.jpg)[^2]
[^2]: Photo Credit, California Department of Water Resources.
## What are MODIS and LANCE?
### MODIS
The **Moderate Resolution Imaging Spectroradiometer (MODIS)** is part of a NASA Earth Observing System (EOS). The satellite-based sensor system creates data products including land surface temperatures, land surface reflectance, radiances, clouds, aerosols, water vapor, active fire, snow cover, sea ice measurements, and other factor information . The MODIS NRT data includes the Flood product which we will explore in this lesson. The NRT Flood product is a daily \~250-meter resolution product showing flood and surface water detected from the twice-daily overpass of the MODIS optical sensors.
The satellite data is readily available shortly after it is acquired by the MODIS instrument on board the Terra and Aqua satellites. This space-based instrument distinguishes 36 spectral bands and groups of wavelengths in three different resolutions at 250 m, 500 m, and 1 km, all which helps map the extent of snow and ice caused by winter storms and frigid temperatures [@atmosphe2023]. Initially, a water-detecting algorithm is applied to both MODIS observations (Terra and Aqua), however, cloud and terrain shadows are known to create false positives in the data.
<img src="https://modis.gsfc.nasa.gov/about/images/modisComponents.jpg" alt="MODIS Components"/>[^3]
[^3]: Photo Credit, NASA GSFC.
MODIS is one of the most advanced systems that can produce earth-observing real time satellite data "designed to provide the maximum operational versatility and performance combined with long life. Most operations, including calibration, are performed under internal processor control in response to time-tagged commands from the spacecraft. The electronics are highly redundant, such that no single-point of failure can disable the data from more than one of the 490 MODIS detectors [@frazier2022]."
The adoption of MODIS aimed to surpass barriers related to satellite data, such as cost, delivery timelines, limited formats, and the need for technical expertise. This transition led to the establishment of the Fire Information for Resource Management System (FIRMS) at the United Nations Food and Agriculture Organization (UN FAO), ensuring continuity in meeting NASA data-user needs [@lin2019; @davies2010] .
### LANCE
The **Land, Atmosphere Near real-time Capability for EOS (LANCE)** is a NASA initiative that provides near real-time access to satellite data, including MODIS. It allows users to access the latest data within a few hours of satellite overpass, enabling rapid responses to environmental events such as floods. Systems like LANCE highlight the importance of readily available data that can be easily consumed and analyzed through data analysis and data visualization for timely response. LANCE is particularly valuable for emergency response teams and researchers who require up-to-date information for monitoring and assessing natural disasters [@LANCE2024; @murphy2015].
LANCE reduces processing time, allowing for timely computation. Users access the data through platforms like Web Map Service (WMS) and Web Coverage Service (WCS), enabling visualization and analysis for informed decision-making. This NRT approach enhances the speed and accessibility of critical information on vegetation conditions [@zhang2022].
### MODIS NRT Flood MCDWD Data Products
The **MODIS/Aqua+Terra Global Flood Product L3 Near Real Time (NRT) 250m Global Flood Product (MCDWD_L3_NRT) (Release 1)** provides daily maps of flooding globally. The product is provided over 3 [compositing periods](https://vip.arizona.edu/documents/dataviewer/Compositing.pdf) (1-day, 2-day, and 3-day) to minimize the impact of clouds and more rigorously identify flood water. The best composite will depend on the cloudiness of a particular event [@lin2019].
::: column-margin
::: {.callout-tip style="color: #5a7a2b;"}
## Data Information
The main landing pages for the MODIS NRT Global Flood Product:
[NASA EARTHDATA](https://www.earthdata.nasa.gov/learn/find-data/near-real-time/modis-nrt-global-flood-product)
[NASA CRM SEARCH](https://cmr.earthdata.nasa.gov/search/concepts/C2018599131-LANCEMODIS.html)
[MODIS NRT Global Flood Product User Guide](https://www.earthdata.nasa.gov/s3fs-public/2023-01/MCDWD_UserGuide_RevC.pdf)
:::
:::
The ***MODIS/Terra+Aqua Combined MODIS Water Detection (MCDWD)*** algorithm is tailor-made for detecting water bodies using MODIS data obtained from both the Terra and Aqua satellites. This algorithm employs various bands and spectral information to effectively identify and categorize water bodies. This enhances the accuracy and reliability of the flood product generated [@slayback2023]. To minimize errors, the product is generated with three different ***compositing periods (1-day, 2-day, and 3-day)*** to compare results and decide which product has better coverage for the event. Further, then product differentiates floods from expected surface water through the use of the MODIS Land Water Mask (MOD44W), which uses a decision tree classifier trained with MODIS data to produce a global water mask [@carroll2016].
The MODIS Near Real-Time (NRT) Flood dataset offers multiple products, each accompanied by corresponding layers. The specific layers depend on the temporal aggregation:
***MCDWD_F1_L3_NRT (1-Day product)*** This product type is the most basic level and provides binary information about water occurrence. Pixels are classified as either containing water or not, offering a simple way to identify flooded areas.
***MCDWD_F1CS_L3_NRT (1-Day CS)***: F1CS has a cloud shadow mask applied on the version of the MCDWD_F1_L3_NRT product. A cloud shadow mask identifies areas that are cloud or shadow of cloud and categorizes them separately. This layer is produced because sometimes cloud-shadows can produce false positives in the data product. Thus, the cloud shadow mask can improve some analysis, however, the cloud mask can also produce errors and remove real water [@rayne2024]. See section 3.4.3 in the [MODIS User Guide](https://www.earthdata.nasa.gov/s3fs-public/2023-01/MCDWD_UserGuide_RevC.pdf).
***MCDWD_F2_L3_NRT (2-Day)***: F2 provides additional information by categorizing water occurrences into three categories: no water, low-confidence water, and high-confidence water. This classification allows for a more nuanced understanding of the extent of the flood and its associated confidence levels.
***MCDWD_F3_L3_NRT (3-Day)***: The F3 product, based on the MCDWD (MODIS/Terra+Aqua Combined MODIS Water Detection) is an algorithm that further refines flood mapping by adding additional spectral information. These results create a more accurate representation of water bodies and flooded areas [@slayback2023].
## Exploring the Data
For this exercise, we will be using the MCDWD L3 F3 product: [LANCE NRT Flood](https://nrt3.modaps.eosdis.nasa.gov/archive/allData/61/MCDWD_L3_F3_NRT/)
If in **2i2c**, clear objects from the current workspace:
```{r}
rm(list = ls())
```
First, install and load the R packages required for this exercise:
```{r}
#List the necessary packages and then check if they are installed
packages_to_check <- c("stars", "httr", "jsonlite", "tmap", "basemaps", "sp", "sf", "ggplot2")
# Check and install packages
for (package_name in packages_to_check) {
if (!package_name %in% rownames(installed.packages())) {
install.packages(package_name)
cat(paste("Package", package_name, "installed.\n"))
} else {
cat(paste("Package", package_name, "is already installed.\n"))
}
library(package_name, character.only = TRUE)
}
#in case tmap does not install
#remotes::install_github('r-tmap/tmap')
```
::: column-margin
::: {.callout-tip style="color: #5a7a2b;"}
## Coding Review
This lesson uses the [stars](https://r-spatial.github.io/stars/), [httr](https://httr.r-lib.org/), [jsonlite](https://cran.r-project.org/package=jsonlite), [tmap](https://cran.r-project.org/package=tmap), [basemaps](https://cran.r-project.org/package=basemaps), and [ggplot2](https://ggplot2.tidyverse.org/) packages. If you'd like to learn more about the functions used in this lesson you can use the help guides on their package websites.
:::
:::
#### Step 2, check what days are available for the MCDWD L3 F3 product by going to this link: [LANCE NRT Primary Server](https://nrt3.modaps.eosdis.nasa.gov/archive/allData/61/MCDWD_L3_F3_NRT) or [Secondary](https://nrt4.modaps.eosdis.nasa.gov/archive/allData/61/MCDWD_L3_F3_NRT). Note: you must have a NASA Earthdata Account.
Based on availability, edit the year_day variable YYYY-DD. Example: '2022-01'
```{r}
#add the year and date you want to search for (YYYY-DD, 2022-01)
year_day <- '2024-163'
```
#### Determine tiles of interest:
[MODIS NRT Tile Map](https://www.earthdata.nasa.gov/s3fs-public/2023-01/MCDWD_GlobalTileMapHoriz_Website_865x2250.jpg?VersionId=lOQ_j47U8T.j7UUqibD7SM63FVHjM_V5)
<img src="https://www.earthdata.nasa.gov/s3fs-public/2023-01/MCDWD_GlobalTileMapHoriz_Website_865x2250.jpg?VersionId=lOQ_j47U8T.j7UUqibD7SM63FVHjM_V5" alt="A global map with labeled tiles" width="100%"/>
Based on availability, edit the tile_code variable:
```{r}
#add tile code from the map above (written as h00v00)
tile_code <- 'h05v05'
```
This is the NRT Flood F3 (MCDWD_L3_F3) API URL:
```{r}
# Primary Server
API_URL <- paste0('https://nrt3.modaps.eosdis.nasa.gov/api/v2/content/details?products=MCDWD_L3_F3_NRT&archiveSets=61&temporalRanges=')
# if the primary server is down, the secondary server may be available:
#API_URL <- paste0('https://nrt4.modaps.eosdis.nasa.gov/api/v2/content/details?products=MCDWD_L3_F3_NRT&archiveSets=61&temporalRanges=')
```
We can combine the API URL above with the year_day provided and print the available datasets:
```{r}
#pasting together URL and year_day
url <- paste0(API_URL, year_day)
print(url)
```
## Loading the Data
**NASA Earthdata** requires a NASA username and token. Go here to learn [How to generate a User Token](https://wiki.earthdata.nasa.gov/display/EL/How+to+Generate+a+User+Token).
Once you have generated a User Token, you can access NASA Earthdata using the GET function [@wilsjame]. Replace `USER_TOKEN` with the token you generated:
```{r}
token <- "USER_TOKEN"
```
Combine the token with `"Bearer"` into a single text string.
```{r}
api_key <- paste("Bearer", token)
```
This code is for use on a local computer, not 2i2. Use to protect your token locally.
```{r, echo=FALSE}
## For use on a local computer, not 2i2c
#api_key <- readLines("lance-modis-nrt-key.txt")
## Convert the API key to a string
#api_key <- toString(api_key)
##"Bearer USER_TOKEN"
#api_key <- paste("Bearer", api_key)
```
```{r}
# Define headers (include your token)
headers <- c(Authorization = api_key)
# Make GET request
response_content <- GET(url, add_headers(.headers=headers))
```
Check the response status from the GET function:
```{r}
response_content
```
Out of the response from the server, we'll check if the response was a success with `if (http_status(response)$category == "Success")`. If this statement is true, then the content will be assigned to the variable `data` in JSON format, which is then parsed to a data frame using `data_parsed <- jsonlite::fromJSON(data)`. The data frame contains `data_parsed$content`, a column with content. We filter the content by tile code using the command `content_items <- data_parsed$content[grepl(tile_code, data_parsed$content$name, ignore.case = TRUE), ]` and add the results to a data frame.
```{r}
if (http_status(response_content)$category == "Success") {
# Read the JSON content into a data frame
response_content <- content(response_content, as = "parsed", simplifyVector = TRUE)
response_content <- response_content$content
} else {
print("Request failed with status code", http_status(response_content)$status_code)
}
names(response_content)
```
Search the Data Frame and subset the rows that contain `tile_code` in the `downloadsLink` column.
```{r}
# Subset rows where the downloadsLink column contains the tile code
response_content <- response_content[grepl(tile_code, response_content$downloadsLink), ]
response_content
```
If there is only 1 row, select the string 'downloadsLink' column:
```{r}
response_content <- response_content$downloadsLink
print(response_content)
```
## Read the Data
Use the `read_stars()` function from the `stars` R Library to read the geoTiff raster. The raster is assigned to the `raster_df` variable. This is a web-derived layer that is in the Geographic Coordinate System (GCS) WGS84 - World Geodetic System 1984 EPSG:4326 (EPSG number 4326). We can assign the Coordinate Reference System (CRS) to the stars object:
```{r}
#read the raster data from the download_link
raster_df <- read_stars(response_content)
#Set a CRS for the stars object based on the data documentation
st_crs(raster_df) <- st_crs(4326)
```
### Visualizing NRT Flood Data
Plot the raster to quickly view it. `downsample` allows faster rendering for viewing:
```{r}
plot(raster_df, downsample=30)
```
### Create NRT Flood Plot with Classification
Refer to the [MODIS NRT Global Flood Product User Guide](https://www.earthdata.nasa.gov/s3fs-public/2023-01/MCDWD_UserGuide_RevC.pdf) for more information.
NRT Flood data has 5 classifications:
| Code | Definition |
|:----:|:--------------------|
| 0 | No Water |
| 1 | Surface Water |
| 2 | Recurring flood[^4] |
| 3 | Flood (unusual) |
| 255 | Insufficient data |
[^4]: Value 2 (Recurring flood) is not populated in the beta release.
To view the data in this classification, we’ll need to create a classified legend; however, the NRT Flood data is stored in decimal numbers (known as floating-point). Class breaks allow us to group together data depending on the values. Create classification breaks that divide the data using the breaks listed below and their corresponding colors and labels. Finally, add a title for the plot that includes the year and day, `year_day`, and the `tile_code`:
```{r}
#class_breaks for the table plot categorization
class_breaks <- c( 0, 1, 2, 3, Inf)
#colors for the plot breaks
colors <- c( "gray", "blue", "yellow", "red")
#labels for the plot breaks
labels = c("0: No Water", "1: Surface Water", "2: Recurring flood", "3: Flood (unusual)")
title = paste("Near Real-Time Flood F3", year_day, tile_code)
```
### Generate a basemap from Esri Imagery
The `basemap_stars()` function from the `stars` package allows us to access Esri imagery layers. To generate a basemap that shows the location of our raster, we must first define the bounding box with the `raster_df`. Choose “world_imagery” as our background and assign it to the object `bm_m`. The `st_rgb` function combines the basemap RGB image into a single file. Finally, `st_transform` transforms the CRS into WGS84.
```{r}
#define basemap using a the raster as a bounding box
bm_m <- basemaps::basemap_stars(raster_df, map_service = "esri", map_type = "world_imagery")
#The `st_rgb` function lets us turn the RGB stars item into a single image
bm_m <- st_rgb(bm_m)
#transform the CRS of the basemap to match the flood raster
bm_m <- st_transform(bm_m, 4326)
```
### Plot basemap and NRT Flood data
Generate a plot from the tmap library using the `tm_shape()` function. We will plot the basemap and the raster_df items in one plot.
```{r}
## tmap mode set to "plot"
tmap_mode("plot")
## tmap mode can also be set to "view"
#tmap_mode("view")
#create an object that plots the basemap and the NRT flood raster
#with the tmap library, call the tm_shape() function for the basemap
tm_plot <- tmap_options(max.raster = c(plot = 50000, view = 50000))+
tm_shape(bm_m)+
#the basemap bm_m is treated as a raster
tm_raster()+
#create a new tmap shape for the NRT flood raster with a style as "cat," meaning categorical.
tm_shape(raster_df, style="cat")+
#add the classification styling to the raster including colors, title, class breaks, and labels
tm_raster( palette = c(colors),
title = title,
breaks = class_breaks,
labels = labels )+
#style the plot
tm_layout(legend.outside = TRUE) +
tm_graticules(lines=FALSE)
#view Plot
tm_plot
```
To observe a location more closely, we can create a new bounding box using latitude and longitude values, clip the data, and replot on a map. First, select from the map four corners in degrees that include North/South and East/West information. We use these coordinates to create a matrix of points representing four corners; the four corners are then used to create a bounding box. The corners are first placed in the GCS WGS84 to match our raster.
```{r}
# Define the NWES coordinates
north <- 40 #negative values are used for South
south <- 36 #negative values are used for South
west <- -124 #negative numbers are used for West
east <- -120 #negative numbers are used for West
# Create a bounding box
bbox_subset <- st_bbox(c(xmin = west, ymin = south, xmax = east, ymax = north), crs = 4326)
```
We redo the same process as above to plot the data with a basemap but with a different bounding box:
```{r}
#generate a new basemap
bm_m <- basemaps::basemap_stars(bbox_subset, map_service = "esri", map_type = "world_imagery")
#combine RGB bands of the bsemap
bm_m <- stars::st_rgb(bm_m)
#transform to CRS 4326
bm_m <- sf::st_transform(bm_m, 4326)
```
```{r}
## tmap mode set to "plot"
tmap_mode("plot")
## tmap mode can also be set to "view"
#tmap_mode("view")
#create an object that plots the basemap and the NRT flood raster
#with the tmap library, call the tm_shape() function for the basemap
tm_plot <- tmap_options(max.raster = c(plot = 50000, view = 50000))+
tm_shape(bm_m, raster.downsample=TRUE)+
tm_raster()+
#create a new tmap shape for the NRT flood raster with style as "cat," meaning categorical.
tm_shape(raster_df, style="cat", raster.downsample=TRUE)+
#add the classification styling to the raster
tm_raster( palette = c(colors),
title = title,
breaks = class_breaks,
labels = labels )+
#style the plot
tm_layout(legend.outside = TRUE) +
tm_graticules(lines=FALSE)
#View the plot:
tm_plot
```
## Zonal Statistics of Flood Zones
The NRT flood data can be compared to the Cropland Data Layer (CDL), which provides agricultural categories based on the Farm Service Agency (FSA) Common Land Unit (CLU) Program and is produced by the U.S. Department of Agriculture National Agricultural Statistics Service (NASS)[@nass2023].
To quantify how many acres of each cropland may be affected by the flooding identified in the NRT Flood data, we can explore the dataset by using the [Cropland Collaborative Research Outcomes System (CroplandCROS)](https://croplandcros.scinet.usda.gov/), and we can download the data from the [NASS website](https://www.nass.usda.gov/Research_and_Science/Cropland/Release/index.php)
Download the latest CDL year of data available from the NASS website.
For this part of the exercise, we will use the package `stars`, discussed in the previous lesson, and the function `read_stars()` to read the CDL raster as a stars object.
```{r}
#Read CDL GeoTIFF
cdl<- read_stars("data/2022_30m_cdls.tif")
#plot the raster and downsample by 100 as the resolution of the raster is high
plot(cdl, downsample=100)
```
This raster is provided with a 30-meter resolution for the entire contiguous U.S.! Additionally, the coordinate reference system(CRS) is the North American Datum of 1983 (NAD83) / Albers Conical Equal Area (AEA) EPSG:5070 projection. We can subset a stars objects with other stars objects if they have the same CRS. We will first subset the NRT Flood raster and project the data into the same CRS as the CDL raster. We can subset this raster with the NRT flood raster, but we must first crop it with the bounding box `bbox_subset`we created previously. We will then use the `warp()` function to change the raster from the CRS WGS84 EPSG:4326 to NAD83 EPSG:5070:
```{r}
#subset the NRT Floor raster with the bounding box
raster_df <- raster_df[bbox_subset]
#warp the raster NRT flood raster to match the CDL raster CRS.
raster_df <- st_warp(raster_df, crs=5070, use_gdal=FALSE,method = "near")
#subset CDL raster with the subset NRT flood raster
cdl <- cdl[raster_df]
#because of the high resolution, a stars proxy object was made
#make a stars object from a stars proxy object to directly read the data
cdl <- st_as_stars(cdl)
#plot the resulting CDL subset
plot(cdl, downsample=100)
```
We are only interested in the areas that are showing as category 3, Flooded (unusual); therefore, we want to exclude other variables. We can do this by setting variables that are not 3 as `NA`. Once we have only the flooded areas from the NRT Flood data, we can subset the CDL data.
```{r}
#set variables as NA that do NOT equal 3
raster_df[raster_df != 3] <- NA
#subset the CDL raster where unusual flooding occurred
cdl <- cdl[raster_df==3]
```
The [CDL FAQ Webpage](https://www.nass.usda.gov/Research_and_Science/Cropland/sarsfaqs2.php#common.7) lists the categorization, including non-agricultural categories which we can remove from the analysis:
Non-Agricultural Categories
| Categorization Code | Land cover |
|:-------------------:|:-------------------------|
| "111" | Open water |
| "112" | Perennial Ice/snow |
| "121" | Developed/Open space |
| "122" | Developed/Low Intensity |
| "123" | Developed/Med Intensity |
| "124" | Developed/High Intensity |
| "131" | Barren |
| "141" | Deciduous Forest |
| "142" | Evergreen Forest |
| "143" | Mixed Forest |
| "152" | Shrubland |
| "176" | Grassland/Pasture |
| "190" | Woody Wetlands |
| "195" | Herbaceous Wetlands |
\`
```{r}
#list of non-agricultural Categories
nonag_cat <- list("Open Water", "Perennial Ice/Snow", "Developed/Open Space", "Developed/Low Intensity", "Developed/Med Intensity","Developed/High Intensity", "Barren", "Deciduous Forest", "Evergreen Forest","Mixed Forest","Shrubland","Grassland/Pasture","Woody Wetlands", "Herbaceous Wetlands")
#exclude non-agricultural categories
for (category in nonag_cat){
cdl[cdl == category ] <- NA
}
```
Next, we can create a table out of the remaining CDL raster data, but we first have to extract it from the stars object and convert it to a matrix. The matrix can then be converted to a table and sorted.
```{r}
# Convert the stars object to a matrix
cdl <- as.matrix(cdl)
# Count the number of pixels for each category
cdl <- table(cdl)
#Sort the `cdl_counts` table by descending order.
cdl<- sort(cdl, decreasing = TRUE)
```
Select the top 10 CDL categories counted.
```{r}
top_10 <- cdl[1:10]
#print top 10
print(top_10)
```
Turn the top 10 table into a dataframe. This will allow us to more easily handle the dataframe table and data.
```{r}
dataplot <- as.data.frame(top_10)
```
According to the NASS CDL data documentation, each of the CDL's pixels represents 30 square meters. Additionally, a hectare is 10,000 meters. We can calculate the hectares of each CDL category by first multiplying the count of each category by 30 and dividing by 10,000.
```{r}
# Calculate the area for each row
dataplot$area <- dataplot$Freq * 30 / 10000
```
Once the area is calculated, we can plot a bar graph to compare the top 10 CDL Categories found in the NRT Flood zone.
```{r}
#create new title for plot
title = paste("Hectares of NASS CDL Cropland Category under flood, \nday", year_day, "in tile", tile_code)
# Find the max value in the table for a dynamic axis plot
max_value <- max(dataplot$area) +3
# use ggplot to plot the dataplot table with area
# in x axis and category name in the y axis
plt <- ggplot(dataplot, aes(x = area, y = cdl)) +
geom_bar(stat = "identity", fill = "skyblue") + # adjust the height and color
geom_vline(xintercept = seq(0, max_value, by = 1),
color = "gray", linetype = "dashed") + #create guide lines
labs(title = title, x = "Hectares (ha)", y = "Categories") + #add labels to plot
theme_minimal() + #set theme
theme(axis.line = element_line(size = 1), # Adjust size of axis lines
axis.ticks = element_line(size = 1),# Adjust size of axis ticks
panel.grid.major = element_blank(), # Remove major gridlines
panel.grid.minor = element_blank(),# Remove minor gridlines
plot.background = element_rect(fill = "white"),# Set plot background color
panel.background = element_rect(fill = "white"),# Set panel background color
panel.border = element_rect(color = "black", fill = NA)) + # Set panel border
geom_text(aes(label = round(area,1)), color = "black") +
xlim(0, max_value) # Adjust x-axis limits # Adjust size of text labels
plt
```
The graph above shows the hectares (ha) of land that were calculated using the NRT Flood data and the CDL cropland data within the bounding box we generated. We did this by selecting the area with abounding box, then subsetting the CDL data to areas that are labeled as "Unusually Flooded" by the NRT Flood data on a particular day, and counting the number of pixels by CDL category. With the count of pixels and the resolution of the CDL dataset, we calculated the total area flooded for each category in ha.
Congratulations! We were able to download and explore NASA NRT Flood data to pick an area of interest in the contiguous U.S. and determine how much cropland may be affect by floods. Now you should be able to:
- Navigate the LANCE data website and determine what data is available.
- Select a tile and date to download NRT data.
- Create a GET HTTP request to download near-real-time data.
- Plot on a map and classify raster data to determine areas with unusual flooding.
- Use NRT Flood data to subset cropland classification data, perform zonal statistics, and graph results.
## Lesson 3
In this lesson, we explored the LANCE MODIS Near-Rear-Time (NRT) Flood dataset. In our next lesson, we will think of water at the local level and focus on New York State School Drinking Water data.
[Lesson 3: New York State School Water Quality: Exposure to Lead](https://topstschool.shinyapps.io/nyc-lead/){.btn .btn-primary .btn role="button"}