Staff Geospatial Systems Engineer
In this role, you will drive geospatial data sourcing, automation, and tooling in an AI/ML pipeline for new applications. You will contribute to and own data research and sourcing, automating data preprocessing, and building tooling to export the geospatial data outputs of an AI/ML pipeline for use in building interactive applications. In this role, you’ll research and gather data for specific regions from different sources, including imagery and other raster data, vector data, and point clouds. You will find and acquire data from free/public domain sources, as well as form and maintain partnerships with commercial data providers. You will be responsible for ensuring the coverage, freshness, and quality of the sourced data. To prepare this data for AI/ML use, you’ll build and use automated tools (incorporating GDAL and in-house libraries) to clean, resize, reformat, and align it so it’s ready for our team to use. You will also be responsible for tooling to automatically postprocess the output of a geospatial AI/ML pipeline for use by other teams. This role involves working closely with different teams in our organization. You’ll partner with the AI/ML teams to supply diverse data they need for building new machine learning models, and with the Pipeline and Product teams to ensure they have the necessary data required. Responsibilities include, but not limited to: Aggregate data for target regions and ensure that a complete collection of data is available. Research open, public domain, and commercial sensor data sources to determine the areas, quality, and freshness of available data. Utilize open source (e.g. gda1) and internally developed software tools to preprocess, align, crop, resample, and otherwise prepare data for use Contribute to our internal tools that process and source geospatial datasets. Partner with groups across the organization to understand needs for GIS data for AI/ML research. Coordinate with AI/ML team to provide diverse data for developing new machine learning algorithms, and the Pipeline and Product teams to provide data to roll out new regions. Partner closely with leadership to understand the high-level product vision