THE EMERGENCE OF WEATHER-RELATED TEST BEDS LINKING RESEARCH AND FORECASTING OPERATIONS [Bulletin of the American Meteorological Society]
| By Weiss, Steven | |
| Proquest LLC |
Test beds have become an integral part of the weather enterprise, bridging research and forecast services by transitioning innovative tools and tested methods that impact forecasts and forecast users.
Over roughly the last decade, a variety of "test beds" have come into existence focused on high-impact weather and the core tools of meteorology-observations, models, and fundamental understanding of the underlying physical processes. They have entered the proverbial "valley of death" between research and forecast operations (NAS 2000), and have survived. This paper provides a brief background on how this happened; summarizes test bed origins, methods, and selected accomplishments; and provides a perspective on the future of test beds in our field. Dabbert et al. (2005) provides a useful description of test beds from early in their development and Fig. 1 summarizes the role of test beds.
Many trace their origins to the U.S. Weather Research Program (USWRP)'s goals of linking weather research and forecasting operations more effectively. Although USWRP leadership initially envisioned that the associated gaps in capabilities and funding could be filled through major new federal appropriations (on the order of
Test bed accomplishments cover a wide range of applications and techniques, from new scientific understanding to better modeling and predictive tools, greater awareness of how weather information is used, and improved outcomes for society. These are achieved through a diverse set of technical and organizational approaches that have emerged organically to meet the needs represented by individual gaps in existing predictive or scientific capabilities. In spite of this diversity in approaches, there are some interesting symmetries between test beds. They often include a core research laboratory upon which scientific staffand tools can be leveraged and administrative infrastructure used. There is usually a specific
As it became clear, by 2008, that several test beds had been created and were producing important results (publications, demonstrations of new tools/ methods, transitions into forecasting operations, etc.), it was decided to hold a "
The test bed summaries herein were prepared by their respective leadership and are presented roughly in the order each was created (Table 1 and Table ES1 of the online supplement contain a listing and brief descriptions of each test bed). Each section includes information regarding the primary focus, objectives, tools used, organizational approach, selected accomplishments, and links to further information. The report concludes with a brief synopsis and description of potential future directions.
JOINT HURRICANE TESTBEDS (JHT). The USWRP formed the JHT in late 2000 in response to the need articulated by the
The JHT is located at the
A biennial Announcement of Opportunity (AO) inviting projects is the initiation for JHT proposaldriven transitions, which includes the program objectives and priorities, and contains a list of NHC,
After 1-2 years of testing, the conclusion of a JHT project is followed by the submission of a final JHT report to NHC's director and/or other operational center(s) if applicable. This report comes from the JHT staffand is based on its evaluation and input from the project scientist(s) and NHC's points of contact. NHC's director makes the decision on whether to begin the process of operational implementation of the techniques resulting from the project-decisions on model changes are made at
Since the JHT's inception, NHC and other operational centers (e.g., CPHC and JTWC) have interacted with scientists on 74 projects, with over half of them implemented into operations. Rappaport et al. (2012) examined the first 10 years of the JHT, its impact on operations, and JHT's contributions to NHC's forecast operations. One project of note developed a way to describe the probability of tropical cyclone wind speed thresholds DeMaria et al. 2009), which is now a routine operational product (Fig. 3 shows an example of two hurricanes). Improvements in tropical cyclone monitoring and prediction in recent years can be credited to the successful implementation of JHT projects.
NOAA'S HYDROMETEOROLOGY TESTBED (HMT). Extreme precipitation and the related hydrometeorological "forcings" that contribute to flooding, such as soil moisture and snowpack, are the focus of HMT (Ralph et al. 2005). Flooding has triggered more presidential disaster declarations than any other single natural hazard and has contributed on average to more than
In spite of its crucial role in both extreme and dayto- day events, quantitative precipitation forecasting (QPF) has remained one of the great challenges in meteorology, especially for extreme events (e.g.,
To address these gaps, HMT conducts research on precipitation and weather conditions that can lead to flooding, fosters transition of scientific advances and new tools into forecasting operations, and supports the broad needs for twentyfirst- century precipitation information for flood control, water management, and other applications. Guided by NWS operational requirements, emerging scientific questions, and new technologies, HMT directly engages forecasters and scientists in research and development. New ideas, technologies, and predictive models are developed, demonstrated, evaluated, and refined through the test bed before being transitioned to operations. HMT will provide prototypes for state-of-the-art forcings for hydrologic prediction systems at NOAA's National Water Center.
A key driver of HMT was the desire expressed by the NWS forecast community and NOAA stakeholders for more continuous engagement with researchers following two field experiments-"CalJet" and "PacJet"-associated with extreme precipitation in
HMT is led by ESRL/PSD, the core sponsor, and includes the following key partners: ESRL/Global System Division (GSD); NCEP/HPC;
Extreme precipitation and flooding have diverse origins meteorologically and vary greatly by region, from land-falling extratropical cyclones on the
In summary, HMT-West has fostered innovative research to improve understanding, monitoring, and prediction of extreme precipitation (evidenced by >60 peer-reviewed publications), and is now active in several regions outside of
JOINT CENTER FOR SATELLITE DATA ASSIMILATION (JCSDA) . The JCSDA was established in 2001 to improve and accelerate the use of research and operational satellite data in numerical weather, ocean, climate, and environmental analysis and prediction. NOAA and
The day-to-day activities of the JCSDA are managed by an executive team composed of the director, the deputy director, and associate directors representing all the JCSDA partner agencies [NOAA/ NWS,
The JCSDA supports scientific development work in priority areas including radiative transfer, clouds and precipitation, advanced instruments, land data assimilation, ocean data assimi lat ion, and atmospheric chemistry and aerosols (Fig. 6). Examples of success include advances in formulating the Community Radiative Transfer Model (CRTM), assessing the impact of assimilation of Advanced Infrared Sounder (AIRS) and Moderate Resolution Imaging Spectrometer (MODIS) data, and provision of AIRS data to operational centers worldwide after the data have been "thinned" appropriately (
The JCSDA research and preoperational implementation experiments are conducted by JCSDA-affiliated scientists with proposal-based funds (internal research) or through external grants and contracts awarded via a competitive process open to the broader scientific community (external research). There are also core projects that are regulated by an agreement between the funding agency and the project principal investigators (directed research). In addition, the JCSDA partners conduct their own internal projects, some of which are directly related to the JCSDA activities. These projects are considered by the JCSDA as in-kind support of JCSDA objectives.
JCSDA activities center on improving the assimilation of satellite data from research and operational sensors on national and foreign satellites and leveraging the efforts of all JCSDA partners. All kinds of satellite data are considered: direct measurements of radiances and brightness temperatures and derived products; observations from both polar-orbiting and geostationary satellites; measurements of instruments sensing in the ultraviolet, visible, infrared, and microwave spectral regions; and data from passive and active sensors, including radio occultation measurements. Recent achievements are listed in Table 3.
The JCSDA organizes annual scientific workshops on satellite data assimilation that are crucial for the technical coordination of the efforts between the different JCSDA partners. It also organizes a data assimilation summer colloquium, every 2-3 years, engaging graduate students and researchers with early postdoctoral appointments in the science of satellite data assimilation for the atmosphere, land, and oceans. The program includes lectures by international experts in data assimilation, and allows students to interact with the lecturers in an informal setting. The objective of the program is to foster the development of the next generation of data assimilation scientists to support environmental modeling. The JCSDA also publishes a quarterly newsletter highlighting recent research and implementation accomplishments, and conducts a monthly seminar series that is webcast nationally and internationally.
HAZARDOUS WEATHER TESTBED (HWT). The HWT has its roots in a culture of collaboration established decades ago among severe weather enthusiasts with a commitment to excellence in both forecasting and research. This collaboration can be traced back to the 1950s when forecasters from the Severe Local Storms Warning Service (SELS) and research scientists with the
Proximity and passion for severe weather were key ingredients in these partnerships. One element of the collaboration revolved around development and field testing of Doppler weather radar and dual polarization improvements (Scharfenberg et al. 2005). NSSL researchers made significant efforts to transition this science and technology to forecasting operations. Specifically, they engaged in month-long visits to more than a dozen WFOs nationwide to provide training, solicit direct feedback from a wide variety of operational forecasters, and facilitate operational implementation (e.g., Lakshmanan et al. 2007). The
When the blueprint for NCEP was presented in the early 1990s (
Concentrating on these themes, the first "spring program" was conducted in the spring of 2000 and became the basis for similar initiatives each spring thereafter. The focus on springtime ensured that compelling real-time convective weather forecasts would be presented nearly every day. The experiments were designed to challenge both model developers and forecasters. About half of each day was devoted to preparing and issuing severe weather forecasts and the other half on critical interrogation of experimental numerical-model guidance. Activities were conducted by small groups containing at least one representative from forecast operations and one model developer or researcher, allowing model developers to gain a broader understanding of how frontline forecasters use model output and the forecasters to develop insight that helped dramatically with interpretation of model guidance for severe weather. The process laid the foundation for new long-term working relationships.
This paradigm-challenging forecasters and researchers to work side by side in small groups to tackle difficult meteorological problems in real time-proved to be very effective (Fig. 7). It galvanized collaborative activities in the
The original HWT framework included two programs: 1) the Experimental Forecast Program (EFP), anchored by SPC-related forecasting research (Kain et al. 2006); and 2) the Experimental Warning Program (EWP), focusing on the development and testing of new science, applications, and remote sensing tools to assist the short-term (0-2 hours) nowcasting and warning decision-making process. In recent years the
SHORT-TERM PREDICTION RESEARCH AND TRANSITION (SPoRT). The SPoRT program transitions unique
Since its establishment, SPoRT has expanded its collaborations to WFOs in all six NWS regions and to several national centers. SPoRT focuses on problems such as the timing and location of severe weather; changing weather conditions influenced by terrain and other local features; reduced surface visibility due to smoke, fog, and low clouds; predicting weather variations due to land-sea breeze circulations; and monitoring weather conditions in data-void regions. SPoRT involves forecasters in the entire process- matching forecast problems to data and research capabilities, testing solutions in a quasi-operational environment, and then transitioning proven solutions into the forecaster's decision support system. SPoRT also develops product training and involves forecasters in the assessment of the utility of the products on the relevant forecast challenges. The suite of SPoRT and collaborative partner products transitioned to the operational weather community is presented in the online supplement (Tables ES2).
A suite of real-time high-resolution MODIS imagery has been successfully used to improve situational awareness for a variety of nowcasting applications. A notable impact on hydrologic forecasting in the upper plains states has been documented by Loss et al. (2009). Atmospheric information from AIRS has been assimilated into weather forecast models and shown to improve the initial conditions and subsequent forecasts of sensible weather elements with the Weather and Research Forecasting (WRF) model (Zavodsky et al. 2012; Chou et al. 2009; McCarty et al. 2009;
SPoRT scientists work collaboratively on forecast problems and product transitions with several other NOAA test beds. A high-resolution enhanced MODIS/Advanced Microwave Scanning Radiometer for EOS (AMSR-E) sea surface temperature (SST) composite product (e.g., Jedlovec et al. 2009; Haines et al. 2007), land surface information from the
SPoRT is extending its transition activities to include new satellite observations integrated into advanced decision support systems in WFOs around the country over the next few years. Data from the Visible/Infrared Imager Radiometer Suite (VIIRS) imaging and Cross-Track Infrared Sounding (CrIS) sounding instruments on the Joint Polar Satellite System (JPSS) will provide follow-on capabilities to those of the NASA MODIS and AIRS instruments. The existing and new data streams from
DEVELOPMENTAL TESTBED CENTER (DTC). The mission of the DTC is to facilitate research-to-operations (R2O) transition in numerical weather prediction (Bernardet et al. 2008). To accomplish this objective, the DTC supports operational systems, performs testing and evaluation of promising NWP techniques, organizes workshops on important NWP areas, and hosts a DTC visitor program. The DTC was officially established in
Mesoscale modeling (MM) . The MM team has focused on testing and evaluation of potential R2O code transitions. In addition to direct model-tomodel intercomparisons, the MM team has provided baseline configuration results to the NWP community (both operational and research) as designated WRF reference configurations (www.dtcenter.org /config). These carefully controlled, rigorous tests and accompanying verification statistics provide the research community with baselines against which the impacts of new techniques can be evaluated and the operational community guidance for selecting configurations with potential value for operational implementation. In addition, the MM team has helped NOAA's
Hurricanes. The focus of the hurricane team is the transfer of new research and development to operations to improve tropical cyclone NWP. The work currently focuses on the
Data assimilation (DA). The DA team bridges the data assimilation research and operational communities by providing the current operational Gridpoint Statistical Interpolation (GSI) capability to researchers [operations to research (O2R)] by enabling the research community to contribute to operational GSI development (R2O), and by facilitating collaboration between distributed GSI developers through the GSI review committee and the community GSI repository. The DA team provides the research community with an annual GSI release containing the latest GSI capabilities, as well as updated documentation. In addition, the DA team actively works with community researchers to help them merge their new DA innovations with GSI software and provides assistance with the process of committing innovations to the GSI repository. Significant R2O activities have included the assimilation of surface observations (air pollutants with diameter of 2.5 mm or less) for the Community Multiscale Air Quality (CMAQ) regional model and the WRF with Chemistry (WRF-Chem) model, the addition of control and state variables for cloud analysis, and GSI enhancements for Rapid Refresh model applications
Ensembles. The DTC Ensembles Team (DET) brings the latest ensemble developments from the community into operations. These developments often come from experimental real-time ensemble forecast systems. Because they are usually run at a horizontal resolution higher than those available to operations, evaluation of these systems provides an opportunity to influence future operational ensembles. To build on this opportunity for enhanced R2O potential, the DET collaborates with the
Verification. Statistical verification of numerical forecasts is beneficial to both forecasters and end users because it can supply objective data about the quality or accuracy of their forecasts. These findings can feed back into decision processes, including those involved with R2O decisions about model elements to be transitioned to operations. Furthermore, routine, continuing verification of operational observations, models, analyses, and forecasts helps NOAA meet its obligations for information quality under the Information Quality Act. The DTC verification team primarily develops, tests, and demonstrates tools and methods for verification, including the Model Evaluation Tools (MET) (www.dtcenter.org/met /users/). Although the primary application for MET is the WRF model, the tools can also be applied to most other forecast models. In addition to providing MET to the community, the software package has become instrumental in collaborative efforts between the DTC and other test beds, including HMT, HWT, and HFIP (e.g., the development of atmospheric river- focused verification methods with HMT that have been implemented in MET). Most recently, focus has been on implementation of new tools and methods for verification of hurricane forecasts.
CLIMATE TEST BED (CTB). NOAA's NWS/ NCEP is the lead agency with responsibility for improving our nation's operational climate predictions on time scales from weeks to years. These predictions enhance our collective ability to understand and predict the state and evolution of the climate system, including linkages between climate and weather (including extremes) on all time scales. In 2004, NCEP and the OAR/Climate Program Office (CPO) jointly established a Climate Test bed facility. The mission of the CTB is to accelerate the transition of research and development into improved NOAA operational climate forecasts, products, and applications. The CTB objectives are
* to accelerate implementation of advances in model improvements, multimodel techniques, forecaster tools, datasets, and observing systems into NOAA climate forecast operations;
* to provide the climate research community with access to operational models, forecast tools, and datasets to enable collaborative research that accelerates additional improvements of NOAA climate forecast products; and
* to develop new and improved operational climate forecast products for use in planning and decision making.
The CTB facility is located at NCEP/Climate Prediction Center (CPC) in
The CTB has made significant progress toward its objectives and major contributions to the NCEP operational forecasts and products, including a multimodel ensemble (MME) climate prediction system, improvements to the Climate Forecast System (CFS), and development of climate forecast products.
MME climate prediction system. The CTB and the broader community have done extensive experimental multimodel prediction research and provided evidence that MME prediction approach yields superior forecasts compared to any single model. CTB developed a prototype the MME prediction system as a proof of concept to demonstrate the potential benefits of a MME system using a NCAR model and NCEP CFS (Kirtman and Min 2009; Paolino et al. 2012). CTB scientists also explored recalibration and consolidation methodologies in multimodel ensembling (Tippett et al. 2008; DelSole and Tippett 2008).
In 2011, CTB organized a team effort to develop a national multimodel ensemble (NMME) strategy (Kirtman 2011) and implemented the experimental NMME prediction system to produce real-time forecasts for the CPC operational monthly/seasonal forecasts. The current NMME system contributors include
NCEP Climate Forecast System improvements. The CTB strategy to improve CFS involves joint team efforts with participation from the external community and NCEP scientists and to use the NCEP operational model as a research tool. For example, scientists from NOAA/ESRL and NCEP identified polar vortex issues and improved the troposphere-stratosphere coupling in the current version, CFSv2 (
CTB has made progress improving two-way communication between NCEP and the external community. The CFSv3 planning workshop provided a more cooperative, multilateral environment for identifying the needs for CFS improvement and future development strategies. CTB is currently working with NCEP and the external community to develop a NCEP climate modeling strategy.
Climate forecast products. To improve the skill of NCEP operational climate forecasts and thus the quality of climate forecasts, CTB works with the user community to improve access to and understanding of climate forecast products. A CTB team from CPC and the Regional Integrated Sciences and Assessments/Climate Assessment for the Southwest (
CTB also funded focused research to develop and improve drought monitoring and prediction products in support of the National Integrated Drought Information System. A CTB team with scientists from the
In the future, CTB will continue to focus on transition of research to NCEP climate operations and enhancing collaborations between NCEP, other test beds, and the external community. CTB will continue to improve the NMME capability and facilitate the planning and implementation of the NCEP climate modeling strategy. CTB will work directly with the
GOES-R PROVING GROUND .
Prototypes of the future GOES-R capabilities can be emulated from current satellite and terrestrial observing systems having higher spatial, spectral, or temporal resolution than the current operational GOES imager, or through synthetic cloud and moisture imagery that can be derived from weather forecast models such as the WRF model. Products being demonstrated in the Proving Ground include (Fig. 10) improved volcanic ash detection, lightning detection, 1-min-interval rapid-scan imagery, dust and aerosol detection, and synthetic cloud and moisture imagery (Grasso et al. 2008; Otkin and Greenwald 2008). These new or enhanced product capabilities will be made possible by the ABI, a 16-channel imager with two visible channels, 4 near-infrared channels, and 10 infrared channels that will provide three times more spectral information, four times the spatial coverage, and an increase in temporal resolution that is more than five times the current imager (Schmit et al. 2005). Other advancements over current GOES capabilities include total lightning detection and mapping of in-cloud and cloud-to-ground f lashes never before available to forecasters from the GLM (
A key component of the
The next-generation GOES will continue providing valuable data to support high-impact weather warnings as well as key inputs for global and regional NWP models. The large quantities of GOES-R data will present new challenges and opportunities that require more intelligent integration of information derived from blended satellite products (e.g., geostationary and polar satellite observations); multidimensional classification of severe storm potential by combining satellite, radar, in situ data, and models; and new ways of visualizing GOES-R data within the AWIPS-II forecaster workstation. Algorithm developers at NESDIS, NASA SPoRT, and the
AVIATION WEATHER TESTBED (AWT). The AWT, located at the Aviation Weather Center (AWC) in
Prior to the AWT's reorganization in 2009, the AWT existed primarily to transfer research concepts from the Aviation Weather Research Program into AWC operations, and was composed of a small area on the AWC forecast floor. Now, the AWT is housed in a new state-of-the-art room (completed in 2010) with computer workstations that replicate the operational workstations used by AWC meteorologists, as well as advanced video teleconferencing capability that allows for broadcasting output from one workstation to one of several large overhead flat-panel monitors (Fig. 11). This room was designed to foster maximum interaction between teams located at different areas, so evaluations could be achieved in a team-oriented environment. The test bed reorganization also launched new collaborations between the AWC and other research groups, such as NCAR, AFWA, ESRL/GSD, GOES-R satellite program,
The AWT was used extensively during the 2011 Summer Experiment from 27 June to
Numerous new and existing datasets were tested during the experiment and each were used to create the graphics, as already noted. High-resolution ensemble and deterministic numerical weather prediction models were tested for their ability to correctly resolve the timing, location, morphology, mode, and porosity of convection. The deterministic 3-km High Resolution Rapid Refresh (HRRR), and the Consolidated Storm Prediction for Aviation (Wolfson et al. 2008), along with a 4-km 12-member AFWA ensemble model and NCEP's SREF system, were used in combination with derived air-traffic-impact forecasts from NCAR to determine the forecast graphics. In addition, the GOES-R program supplied the "nearcast" forecasts (
Beyond the planned annual summer experiment, the test bed is also evaluating new interactive weather data display software-AWIPS-II is the next-generation data display system for the NWS. Also, the Interactive Calibration of Grids in Four Dimensions (IC4D; Petrescu and Hall 2009) software, an extension of the Graphical Forecast Editor in AWIPS, is undergoing evaluation by the AWC forecast staffwithin the AWT. The IC4D system can be used to combine observations, model data, and algorithms to create a gridded forecast-a concept for the "4-D Weather Cube" envisioned by
Many new concepts for the future forecast process and support of
OBSERVING SYSTEM SIMULATION EXPERIMENT TESTBED (OSSE). The most recent test bed effort is the Observing System Simulation Experiment Testbed. OSSEs are an important tool for evaluating the potential impact of proposed new observing systems, as well as for evaluating trade-offs in observing system design, and in developing and assessing improved methodology for assimilating new observations on numerical weather prediction (Atlas 1997). The test bed development is being led and managed through NOAA/ AOML for use by USWRP partners and academia in collaboration with NESDIS/
* determine the potential impact of proposed spacebased, suborbital, and in situ observing systems on analyses and forecasts;
* evaluate trade-offs in observing system design;
* assess proposed methodology for assimilating new observations in coordination with JCSDA; and
* define both the advantages and limitations of a hierarchy of OSSEs that includes rapid prototyping of instrument or data assimilation concepts, as well as the more rigorous "full" OSSEs.
Although only started in 2010 through seed funding by NOAA USWRP, the OSSE test bed has had several key accomplishments: provided expertise on OSSEs to NOAA and JCSDA partners and academia, and evaluated the global OSSE system and the experiments being performed; finalized regional OSSE nature runs at 3- and 1-km resolution, which required an exhaustive number of iterations of the WRF model embedded within an ECMWF global nature run; confirmed the validity (strong points and weaknesses) of both the 3- and 1-km nature runs over a 13-day period; completed the first phase of a global OSSE for the Unmanned AircraftSystem (UAS) and completed a report and one refereed article from this OSSE; and established an external advisory committee for the OSSE test bed.
During the next several years, test bed activities include a survey across NOAA line offices to take stock of existing Observing System Experiment (OSE) and OSSE capabilities. This will include capturing the capabilities and expertise of each organization and the ability of each organization to perform and/or analyze experiments. Through the
CONCLUSIONS AND FUTURE DIRECTIONS. Test beds have become an integral part of the meteorological community. They have helped foster new forecast innovations and their transition into operations. These developments have powered opportunities for businesses and agencies to improve their products and services. Along the way, a community of subject matter experts has been created that have in-depth experience with bridging research and operations. Not surprisingly, as key forecast challenges and gaps are identified, new regionally focused test bed ideas have been proposed. Lining up support, connecting key research and NWS center "champions," establishing other-agency partners, and identifying resources are all part of developing new test bed concepts.
A major risk for test beds is based on their inherent nature as a "bridging" entity. In other words, they tend to be "outsiders" relative to either the core mission of forecasting or the core mission of research. In spite of this, they enable more rapid improvements in forecast services and demonstrate tangible relevance of research centers to forecast services while not being entirely beholden to them.
For NWS, implementation into operations to meet service requirements includes successful demonstration of key criteria (defined for the specific model/ phenomena/capability), such as objective performance (e.g., model accuracy or sensor accuracy), subjective performance (e.g., utility of capability and impact on workflow/workforce), and production readiness (analogous to technology performance measures, but includes necessary IT infrastructure and backups, maintenance procedures, archiving, and in-place verification approach to ensure timely and reliable operational production). These are demonstrated in proving grounds; in some cases test beds also perform these functions-for example, for tools that are implemented directly in NHC systems, JHT can perform this function. Given that the level of effort to carry out these "transition oriented" steps could rapidly consume test bed investments in innovation and demonstration at stages prior to transition, it is vital that management and oversight for these key steps are primarily the responsibility of the operations, rather than the research, organization. The sidebar "GPRA measures" describes issues and perspectives on measuring performance of test beds and forecasting. Possible approaches for measuring performance that are adoptable by test beds and forecast centers include
* internal measures suitable for state-of-the-art science and technology development (i.e., measure the innovation that underpins future breakthrough advances-the S&T "push");
* "infusion"-oriented measures, including test bed demonstration performance measures (DPM);
* internal measures in "forecast service" programs tracking implementation of infusion (i.e., measure the services' "pull" for S&T);
* internal measures tracking the rate at which innovation is assimilated into forecast operations and the rate at which outdated forecast tools are discontinued; and
* use of technical readiness levels to help define the status of key transition activities.
Carrying this out requires adequate capacity and investment in the test beds and a commitment from forecast centers and laboratories. The recent creation by NWS of the "
Carrying this out requires adequate capacity and investment in the test beds and a commitment from forecast centers and laboratories. The recent creation by NWS of the "
In closing, test beds have become an integral part of the weather enterprise. They have developed, tested, and transitioned innovative tools and methods that are impacting forecasts and forecast users. A key direction is to identify commonalities in major gaps identified across multiple test beds (i.e., observations, modeling, and physical understanding) and coordinate requests for agencies to fill these gaps. The need to bridge research and forecast services represents a grand challenge to meteorology-a challenge that test beds have emerged over the last 10 years to address.
ACKNOWLEDGMENTS. The U.S. Weather Research Program in
DEFINITION AND OBJECTIVES FOR NOAA TEST BEDS AND PROVING GROUNDS
TEST BEDS
i) D efinition and purpose: A NOAA test bed is a working relationship for developmental testing in a quasi-operational framework among researchers and operational scientists/experts (such as measurement specialists, forecasters, and IT specialists) including partners in academia, the private sector, and government agencies, aimed at solving operational problems or enhancing operations in the context of user needs. A successful test bed involves physical assets as well as substantial commitments and partnerships.
ii) What is tested: Advances to be considered include candidates for more effective observing systems, better use of data in forecasts, improved forecast models, and applications for improved services and information with demonstrated economic/public safety benefits.
iii) Objectives: Test beds accelerate the translation of research and development (R&D) findings into better operations, services, and decision making. Outcomes from a test bed are capabilities that have been shown to work with operational systems and could include more effective observing systems, better use of data in forecasts, improved forecast models, and applications for improved services and information with demonstrated economic/public safety benefits. Successfully demonstrated test bed capabilities are ready for advanced predeployment testing, in a full simulation of real-time operational conditions, leading to "go/no go" deployment decisions.
OPERATIONS AND SERVICES PROVING GROUNDS
i) Definition and purpose: Operations and services proving grounds are a framework for NOAA /NWS to conduct testing of advanced operations, services, and science and technology capabilities that address the needs of both internal and external users. Successful testing demonstrates readiness to implement into operations.
ii) What is tested: Capabilities to be tested in operational proving grounds have already passed developmental testing. Such capabilities include advanced observing systems, better use of data in forecasts, improved forecast models, and applications for improved services and information with demonstrated economic/public safety benefits.
iii) Objectives: Testing in real time, in an operations-like setting to demonstrate achievement of performance metrics, including testing any workflow changes, needed for implementing in operations as well as end-to-end delivery of services. Performance metrics are defined for each candidate capability in categories of objective performance (e.g., accuracy/skill), subjective evaluations of utility (e.g., user feedback on balance positive), and production/engineering readiness (e.g., systems and communications reliability/security/backup, data retention). Performance criteria for objective and subjective evaluations by users internal to NWS include expected impacts to workflow and workload, except when advanced capabilities have no impact on workflow/workload (e.g., in the case of improvements to numeric quality of current operational guidance and tools). Successful predeployment testing is necessary for approval to implement into operations. (Excerpted from
A FRAMEWORK FOR PERFORMANCE MEASURES FOR TEST BEDS
With the advent of the Government Performance Requirements Act (GPRA ), agencies are held highly accountable for performance. For NOAA , several of its "GPRA measures" represent forecasting skill (e.g., hurricane track forecast error, flash flood warning lead time, quantitative precipitation forecast skill, and tornado warning lead time). These measures have become a major focus of current forecasting and their improvements that represent the "requirements pull" of today's services. They are calculated by NOAA /NWS and NOAA reports them to the
While quite useful, these GPRA measures are difficult to change, and it is difficult to add new ones, even when well justified by forecast user needs. Understandably, it is risky for NOAA to promise too rapid an improvement in these challenging forecast topics. This inhibits setting ambitious goals that can drive innovation in the research community. Analogously, the science and technology communities have well-established measures of research and development performance (e.g., publications, citations, patents). Such measures tend not to reward focusing on the implementation of the new findings beyond the research community, thus inhibiting efforts to "take the next step" beyond publications and grants (NAS 2000). While NOAA laboratories help fill some of this gap, the differences between the standard measures used for science and those used for forecasting represent part of the divide between research and operations.
Several constraints have inhibited progress both in innovation and in transition to daily forecast operations. Here are some key examples:
* science and technology (S&T) advances are a foundation of NOAA 's service improvements, yet are often not initially measurable in the "service" GPRA scores;
* improving the service GPRA scores requires service programs to adopt new methods, yet this may have a cost and require services to let go of existing methods; and
* while research suggests fast improvements in GPRA scores may be possible, operational goals must be reasonably achievable or the risk of "failure" is increased.</p>
Because the GPRA measures focus on products issued by NWS, and improvements in these products are often the result of a combination of many inseparable individual advances, a traceable connection between specific S&T advances and formal NWS service improvements is often not very tangible. This creates an underlying issue for the research community and for related test beds-that is, how to measure research and test bed performance in ways that reasonably represent both the underlying advances needed in S&T to enable transformative improvement in forecast services, as well as the near-term incremental improvements that typically build on existing operational tools.
Test beds have the potential to help by developing and monitoring what could be called DPM s, which would be used internally to the agency and test bed. These could be "stretch" versions of current measures (i.e., faster rate of improvement) or entirely new measures that address major societal needs [e.g., rapid hurricane intensity change; QPF for extreme precipitation; river flood warning lead time; snow-level aloft(White et al. 2010)]. The concept, illustrated in Fig. SB1, conveys the following:
* goals for GPRA -like DPM scores can be set higher in test beds than in full operations;
* adoption of new methods for full operations requires proof of concept;
* proof of concept can be demonstrated by limiting tests to small areas, times, and tools;
* by limiting the scope of tests, the costs can be kept within reasonable bounds;
* researchers and forecasters jointly define strategies to demonstrate impacts on the suitable DPM goal during the tests; and
* if regional testing demonstrates improvement, extend results nationally (as appropriate) with follow-up testing.
This demonstration concept has been the de facto approach to date, but has not been codified and adopted in a transparent manner useable by test beds. NCEP uses it to evaluate whether model changes should be adopted operationally. JHT uses this approach extensively, and is a model of how to apply to a specific welldefined forecast problem with one NCEP center. Warning decision support tools turn new data into forecast usable information.
1 Each of the test beds described here were represented at a
REFERENCES
Anderson, M. C.,
Atlas, R., 1997: Atmospheric observations and experiments to assess their usefulness in data assimilation. J. Meteor. Soc.
Auciello, E. P., and
Bernardet, L., and Coauthors, 2008: The Developmental Testbed Center and its Winter Forecasting Experiment. Bull.
Brooks, H. E.,
Brotzge, J.,
Case, J. L.,
Chou, S.-H.,
Chung, D., and
Clark, A. J., and Coauthors, 2012: An overview of the 2010 Hazardous Weather Testbed Experimental Forecast Program Spring Experiment. Bull.
Corfidi, S. F., 1999: The birth and early years of the Storm Prediction Center. Wea. Forecasting, 14, 507-525.
Dabbert, W. F., and Coauthors, 2005: Multifunctional mesoscale observation networks. Bull.
DelSole, T., and
DeMaria, M.,
DOC/NOAA/NESDIS, 2004: Third GOES users conference. DOC/NOAA/NESDIS Conf. Rep., 90 pp. [Available online at www.goes-r.gov.]
Doswell, C. A., and
_____, and Coauthors, 2013: The GOES-R Geostationary Lightning Mapper (GLM).
Grasso, L. D.,
Haines, S. L.,
Heinselman, P. L.,
Hillger, D.,
Jedlovec, G. J.,
Junker, N. W.,
Kain, J. S., 2004: The Kain-Fritsch convective parameterization: An update.
_____,
_____,
_____, S. R. Dembek,
Kirtman, B. P., cited 2011: Toward a National Multi- Model Ensemble (NMME) system for operational intra-seasonal to interannual (ISI) climate forecasts. [Available online at www.cpc.ncep.noaa.gov /products/ctb/MMEWhitePaperCPO_revised.pdf.]
_____, and
Lakshmanan, V., T. Smith,
Lazo, J. K.,
Levit, J. J.,
Loss, G.,
McCarty, W., G. Jedlovec, and
McPherson, R. D., 1994:
Morss, R. E., and
NAS, 2000: From research to operations in weather satellites and numerical weather prediction: Crossing the valley of death. The National Academies Board on Atmospheric Sciences and Climate Rep., 80 pp. [Available online at http://dels.nas.edu/Report /From-Research-Operations-Weather/9948.]
Neiman, P. J.,
NOAA Science Workshop Program Committee, cited 2010: Strengthening NOAA Science: Findings from the
Otkin, J. A., and
Paolino, D. A.,
Petersen, R., and
Petrescu, E., and T. Hall, 2009: IC4D-A new tool for producing four dimensional aviation forecasts. Preprints, Aviation, Range, and Aerospace Meteorology Special Symp. on Weather-Air Traffic Management Integration,
Ralph, F. M., and
_____, and _____, 2012: Historical and national perspectives on extreme
_____, and Coauthors, 2005: Improving short-term (0-48 h) cool-season quantitative precipitation forecasting: Recommendations from a USWRP workshop. Bull.
_____,
_____,
_____, _____,
Rappaport, E. N., J.-G. Jiing,
Rozumalski, R. A., 2007: WRF Environmental Modeling System User's Guide: Demystifying the process of installing, configuring, and running the
Scharfenberg, K. A., and Coauthors, 2005: The Joint Polarization Experiment: Polarimetric radar in forecasting and warning decision making. Wea. Forecasting, 20, 775-788.
Schmit, T. J.,
-, -, and
Stano, G. T., K. K. Fuell, and
Suselj, K.,
Teixeira, J., and Coauthors, 2011: Tropical and subtropical cloud transitions in weather and climate prediction models: The GCSS/WGNE Pacific Cross-Section Intercomparison (GPCI). J. Climate, 24, 5223-5256.
Tippett, M. K., T. DelSole,
Tollerud, E. I., and Coauthors, 2013: The DTC ensembles task: A new testing and evaluation facility for mesoscale ensembles. Bull.
White, A. B.,
_____, and Coauthors, 2012: NOAA's rapid response to the
Wolff, J. K.,
Wolfson, M. M.,
Xiao, H., C.-
Zavodsky, B. T., S.-
AFFILIATIONS:
CORRESPONDING AUTHOR:
E-mail: [email protected]
The abstract for this article can be found in this issue, following the table of contents.
DOI :10.1175/BAMS -D-12-00080.1
A supplement to this article is available online (10.1175/BAMS -D-12-00080.2)
In final form
©2013
| Copyright: | (c) 2013 American Meteorological Society |
| Wordcount: | 13378 |



A CONVERSATION WITH MARIA COGNETTI [Central Penn Business Journal (PA)]
Advisor News
- Trump bets his tax cuts will please Las Vegas voters on his swing West
- Lifetime income is the missing link to global retirement security
- Don’t let caregiving derail your clients’ retirement
- The ‘magic number’ for retirement hits $1.45M
- OBBBA can give small-business clients opportunities for saving
More Advisor NewsAnnuity News
- Human connection still key in the new annuity era
- Lifetime income is the missing link to global retirement security
- ‘All-weather’ annuity portfolios aim to sharply limit rainy days
- Annuity income: The new 401(k) standard?
- Smart annuity planning can benefit long-term tax planning
More Annuity NewsHealth/Employee Benefits News
- Elevance shares slip after insurer discloses Medicare warning
- County leaders look at ways to cut costs
- TENNESSEE SENATE PASSES BIOMARKER TESTING COVERAGE BILL, SENDING TO GOVERNOR'S DESK
- Federal judge sides with Oregon Right to Life in abortion insurance coverage case
- HSAs: Saving for health care or investing for retirement?
More Health/Employee Benefits NewsLife Insurance News
- AI and life insurance: Fast today, unpredictable tomorrow
- Judge allows PHL policyholders to intervene, denies ‘premium holiday’
- eHealth expands into final expense insurance
- CID hosts info session for PHL Variable policyholders
- ‘Seismic changes’ cloud global economy, analyst says
More Life Insurance News