Data Engineer

Other Jobs To Apply

No other job posts for this day.

<h1 style="line-height:1.38;margin-top:27px;margin-bottom:8px;"><span style="font-size:20pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;">Data Engineer</span></span></span></span></span></span></h1><p style="line-height:1.38;"><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;">This is a demanding role on a small, high-leverage team. You'll be one of a handful of people responsible for the data behind IPinfo's location and context products, working on ambiguous problems with messy inputs and owning your pipelines end to end - including understanding every line you ship.</span></span></span></span></span></span></p><h2 style="line-height:1.38;margin-top:24px;margin-bottom:8px;"><span style="font-size:16pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;">What you'll do</span></span></span></span></span></span></h2><ul><li style="list-style-type:disc;"><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:700;"><span style="font-style:normal;"><span style="text-decoration:none;">Make sense of large, unfamiliar datasets</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;"> sourced from publicly-contributed (and therefore inconsistent) datasets like OpenStreetMap and Overture, as well as error-prone device datasets with sometimes dozens of poorly-documented columns.  Your job is to wade through these datasets, figure out what is going on, and extract a meaningful signal.</span></span></span></span></span></span></li><li style="list-style-type:disc;"><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:700;"><span style="font-style:normal;"><span style="text-decoration:none;">Maintain and extend BigQuery data pipelines</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;">, writing efficient, transparent code that achieves complex data tasks while avoiding bloat and spaghetti.</span></span></span></span></span></span></li><li style="list-style-type:disc;"><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:700;"><span style="font-style:normal;"><span style="text-decoration:none;">Work with particular expertise on Geospatial data</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;">, knowing the suite of BigQuery geospatial tools like the back of your hand, while dealing with the particular headaches and challenges that geospatial data poses.  Occasionally working in python as well.</span></span></span></span></span></span></li><li style="list-style-type:disc;"><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:700;"><span style="font-style:normal;"><span style="text-decoration:none;">Use AI tooling to move quickly while fully owning every line in your PRs</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;">.</span></span></span></span></span></span></li><li style="list-style-type:disc;"><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:700;"><span style="font-style:normal;"><span style="text-decoration:none;">Communicate problems and solutions clearly</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;"> using our internal issue-tracking platform; writing concise, reproducible records of the problem, the proposed solutions, and why you made the calls you did, so others can follow and build on them.</span></span></span></span></span></span></li><li style="list-style-type:disc;"><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:700;"><span style="font-style:normal;"><span style="text-decoration:none;">Work occasionally on web-based dashboards</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;"> to provide visibility to our data pipelines for data engineers as well as others at the company.</span></span></span></span></span></span></li></ul><h2 style="line-height:1.38;margin-top:24px;margin-bottom:8px;"><span style="font-size:16pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;">What we're looking for</span></span></span></span></span></span></h2><h3 style="line-height:1.38;margin-top:21px;margin-bottom:5px;"><span style="font-size:13.999999999999998pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#434343;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;">Must have</span></span></span></span></span></span></h3><ul><li style="list-style-type:disc;"><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:700;"><span style="font-style:normal;"><span style="text-decoration:none;">Advanced SQL</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;"> - window functions, CTEs, query restructuring for performance, and an understanding of why a query is slow and how to fix it. BigQuery is a strong plus.</span></span></span></span></span></span></li><li style="list-style-type:disc;"><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:700;"><span style="font-style:normal;"><span style="text-decoration:none;">Strong communication skills</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;"> - you know how to talk and write about complex problems and data pipelines productively.</span></span></span></span></span></span></li><li style="list-style-type:disc;"><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;">A </span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:700;"><span style="font-style:normal;"><span style="text-decoration:none;">track record of turning messy, ambiguous data into reliable, interpretable signals</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;">, with the judgment to explain your calls.</span></span></span></span></span></span></li><li style="list-style-type:disc;"><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;">An </span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:700;"><span style="font-style:normal;"><span style="text-decoration:none;">internet record of significant experience</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;"> as a data scientist or engineer, on Github, StackOverflow, in the academic literature or on a personal blog, or strong references to back up a track record on proprietary code bases.</span></span></span></span></span></span></li><li style="list-style-type:disc;"><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:700;"><span style="font-style:normal;"><span style="text-decoration:none;">Clean-code discipline</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;">: you don't ship code without tests, code review, readable abstractions.  You prefer subtractive solutions to additive solutions.</span></span></span></span></span></span></li><li style="list-style-type:disc;"><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:700;"><span style="font-style:normal;"><span style="text-decoration:none;">Fast learning</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;"> - comfort becoming productive in unfamiliar domains (internet measurement, geospatial reasoning, internal tooling) with little hand-holding.</span></span></span></span></span></span></li><li style="list-style-type:disc;"><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:700;"><span style="font-style:normal;"><span style="text-decoration:none;">AI-assisted development paired with full ownership</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;"> - you can read, debug, and defend everything the tools produce.</span></span></span></span></span></span></li><li style="list-style-type:disc;"><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:700;"><span style="font-style:normal;"><span style="text-decoration:none;">Geospatial fundamentals</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;">: coordinate systems, spatial joins, containment, polygon operations.</span></span></span></span></span></span></li></ul><h3 style="line-height:1.38;margin-top:21px;margin-bottom:5px;"><span style="font-size:13.999999999999998pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#434343;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;">Nice to have</span></span></span></span></span></span></h3><ul><li style="list-style-type:disc;"><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;">Cloud tooling and workflow orchestration (</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:700;"><span style="font-style:normal;"><span style="text-decoration:none;">CI/CD</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;">, </span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:700;"><span style="font-style:normal;"><span style="text-decoration:none;">Docker</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;">, </span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:700;"><span style="font-style:normal;"><span style="text-decoration:none;">Airflow</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;">, etc.).</span></span></span></span></span></span></li><li style="list-style-type:disc;"><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:700;"><span style="font-style:normal;"><span style="text-decoration:none;">JavaScript and web dashboards</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;"> (e.g. Retool, Mapbox, internal validation and visualization tooling).</span></span></span></span></span></span></li><li style="list-style-type:disc;"><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;">Exposure to the </span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:700;"><span style="font-style:normal;"><span style="text-decoration:none;">science of internet measurement</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;">: BGP/ASN, rDNS, RTT-based geolocation, CGNAT, mobile vs. fixed-line IP behavior, geofeeds.</span></span></span></span></span></span></li><li style="list-style-type:disc;"><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:700;"><span style="font-style:normal;"><span style="text-decoration:none;">Strong Python for geospatial data work</span></span></span></span></span></span><span style="font-size:11pt;font-variant:normal;white-space:pre-wrap;"><span style="font-family:Arial, sans-serif;"><span style="color:#000000;"><span style="font-weight:400;"><span style="font-style:normal;"><span style="text-decoration:none;"> - comfortable with the data and geospatial stack (pandas, geopandas, shapely) and writing code that holds up in production, not just in a notebook.</span></span></span></span></span></span></li></ul>

Back to blog