paint-brush
What Needs Improvement in DeepSee for Ocean Research? Experts Weigh Inby@oceanography
171 reads

What Needs Improvement in DeepSee for Ocean Research? Experts Weigh In

tldt arrow

Too Long; Didn't Read

Experts discussed several limitations of DeepSee, including challenges with data formatting, accuracy of interpolation methods, and the impracticality of real-time data input during expeditions. Future enhancements, like real-time data integration, are anticipated as the system evolves.
featured image - What Needs Improvement in DeepSee for Ocean Research? Experts Weigh In
Oceanography: Everything You Need to Study the Ocean HackerNoon profile picture
0-item

Abstract and 1 Introduction

2 Related Work

3 Methodology

4 Studying Deep Ocean Ecosystem and 4.1 Deep Ocean Research Goals

4.2 Workflow and Data

4.3 Design Challenges and User Tasks

5 The DeepSea System

  • 5.1 Map View
  • 5.2 Core View

5.3 Interpolation View and 5.4 Implementation

6 Usage Scenarios and 6.1 Scenario: Pre-Cruise Planning

  • 6.2 Scenario: On-the-Fly Decision-Making

7 Evaluation and 7.1 Cruise Deployment

7.2 Expert Interviews

7.3 Limitations

7.4 Lessons Learned

8 Conclusions and Future Work, Acknowledgments, and References

7.3 Limitations

Several data wrangling and analysis limitations were expressed during the interviews. A common issue discussed was prerequisite knowledge needed to format the data; e.g., P5 faced challenges “inputting DNA sequence information. Knowing how to input data and the centralization of that data is challenging. It’s often that one person ends up inputting data.” While combining multiple data types in a single interface was a design goal, P1 lamented a reduction in dimensionality: “Certain data types might have different data descriptions and having multiple columns and reshaping tables is a challenge... Do I want to try to fit existing data into a certain tool like DeepSee?” P1 also wanted taxonomic data stratified hierarchically, a feature common to many biological analysis systems: “There could be very rare individual units that are technically different taxa. This creates very sparse data where abundance is very small for most of the columns. We needed a way to pick and find different subsets of taxa based on those hierarchies.” Finally, P4 wanted to see “a photo or snapshot of what a core actually looked like. An in-situ photo would allow a person to see what the sea floor looked like, what other cores were around it.”


The Interpolation View provided a contextualized way to view the data, as most other similar programs visualize gradients in 2D without spatial context between the sampled cores. However, there was a lack of confidence in the accuracy of the interpolation methods, as natural neighbors and linear gradients are unlikely to reflect spatially heterogeneous in-situ environmental processes. Similarly, distance to the nearest sample as a proxy for uncertainty was not accurate enough to use for in-situ decision making. Still, P2 suggested potential applications of Value-Suppressing Uncertainty Palettes (VSUPs) [11] in the future as the team develops more accurate uncertainty quantification: “We can see where the data is not strong and potentially sample to strengthen our predictions.” Specifically, the researchers noted that the modular nature of DeepSee will make incorporating more sophisticated interpolation methods (e.g., a Gaussian process such as Kriging [36]) and uncertainty measures (e.g., variance in the posterior predictive distribution of the Gaussian process) in the future straightforward.


One deployment goal that was not validated during DeepSee’s maiden expedition was whether data could be input in real time to visualize, as many of the relevant geochemical and biological parameters could not be measured while on the cruise, requiring further analysis on land. P3 reflected on this challenge, saying: “We talked about using DeepSee on the fly during the research expedition to load data right away. So far, that’s been impractical. We don’t have geochem or bio data to put in DeepSee while at sea. We also haven’t had a cruise to test whether these challenges were due to personnel on board or based on limitations with the DeepSee interface. Everyone was overextended on the expedition and no one had time to do additional data entry on the computer.” However, the team felt this helped them understand the time and energy allotment needed to use DeepSee for real-time data integration and visualization. P3 further discussed how they envision DeepSee being used on future cruises: “In the future, we’ll have a full team of scientists, and we’ll also have a manned submersible taking cores... We’ll have time to load the day’s cores into DeepSee to help plan for the next day... The important objective is having more people to do the same number of tasks... I could see this becoming one of the evening tasks: take the day’s coordinates and load them into DeepSee.” We synthesize design guidance for supporting real-time decision making shipboard in the next section.


This paper is available on arxiv under CC BY 4.0 DEED license.


Authors:

(1) Adam Coscia, Georgia Institute of Technology, Atlanta, Georgia, USA (acoscia6@gatech.edu);

(2) Haley M. Sapers, Division of Geological and Planetary Sciences, California Institute of Technology, Pasadena, California, USA (hsapers@caltech.edu);

(3) Noah Deutsch, Harvard University Cambridge, Massachusetts, USA (ndeutsch@mde.harvard.edu);

(4) Malika Khurana, The New York Times Company, New York, New York, USA (malika.khurana@nytimes.com);

(5) John S. Magyar, Division of Geological and Planetary Sciences, California Institute of Technology Pasadena, California, USA (jmagyar@caltech.edu);

(6) Sergio A. Parra, Division of Geological and Planetary Sciences, California Institute of Technology Pasadena, California, USA (sparra@caltech.edu);

(7) Daniel R. Utter, rwipfler@caltech.edu Division of Geological and Planetary Sciences, California Institute of Technology Pasadena, California, USA (dutter@caltech.edu);

(8) John S. Magyar, Division of Geological and Planetary Sciences, California Institute of Technology Pasadena, California, USA (rwipfler@caltech.edu);

(9) David W. Caress, Monterey Bay Aquarium Research Institute, Moss Landing, California, USA (caress@mbari.org);

(10) Eric J. Martin Jennifer B. Paduan Monterey Bay Aquarium Research Institute, Moss Landing, California, USA (emartin@mbari.org);

(11) Jennifer B. Paduan, Monterey Bay Aquarium Research Institute, Moss Landing, California, USA (paje@mbari.org);

(12) Maggie Hendrie, ArtCenter College of Design, Pasadena, California, USA (maggie.hendrie@artcenter.edu);

(13) Santiago Lombeyda, California Institute of Technology, Pasadena, California, USA (santiago@caltech.edu);

(14) Hillary Mushkin, California Institute of Technology, Pasadena, California, USA (hmushkin@caltech.edu);

(15) Alex Endert, Georgia Institute of Technology, Atlanta, Georgia, USA (endert@gatech.edu);

(16) Scott Davidoff, Jet Propulsion Laboratory, California Institute of Technology, Pasadena, California, USA (scott.davidoff@jpl.nasa.gov);

(17) Victoria J. Orphan, Division of Geological and Planetary Sciences, California Institute of Technology, Pasadena, California, USA (vorphan@caltech.edu).