Section A · Orient

The Roles, Decoded

SentiLink Staff DS (Full Stack) and Archetype AI Data Scientist side by side. What each team actually does, what each line of the JD really means, and where the two roles converge despite different seniorities.

Archetype AI — Data Scientist (a.k.a. AI Solutions Specialist)

Posting at a glance

Role: Data Scientist (posted under several titles incl. "AI Solutions Specialist") · Team: Go-To-Market / Solutions Engineering · Location: San Mateo, CA (on-site) · Seniority: 4+ years

What Archetype is

Archetype AI builds "Newton," a multimodal-LLM platform for the physical world — fusing video, sensor, time-series, and other real-world data into a model that can reason about physical environments. Series A startup, founded by former Google engineers. The customer pitch: build perception systems for industrial, mobility, retail, and IoT applications without having to train custom CV/sensor models per use case.

What this role actually is

Don't be misled by the "Data Scientist" title. The team is GTM / Solutions Engineering, not core research. The work is customer-facing applied AI: a customer brings raw sensor or video data and a business question, you turn it into a working POC on Newton.

Concretely:

  • Review incoming customer assets (video, sensor streams) for feasibility.
  • Prepare datasets in Python — cleaning, structuring, imputation, filtering, normalization, signal processing.
  • Support labeling and annotation workflows end-to-end (Encord-style tools).
  • Iterative preprocessing cycles with evaluation and refinement.
  • Design and refine prompts (including n-shot examples) for Newton's "lenses."
  • Configure lens parameters for POC runs.
  • Summarize results in customer-ready reports.

The defining sentence of the JD

Read this twice

"Startup-ready mindset with the ability to thrive in high-velocity, high-ambiguity environments. Self-starter with bias for action... driving them from raw data to testable results without waiting for step-by-step direction."

Decoded: they need someone who can take ambiguous customer input and produce a working POC in days, not weeks. The on-site requirement and GTM placement reinforce this — you're embedded with Solutions Architects and Sales, expected to move at the speed of customer conversations.

The stacks, decoded

SentiLink stack

JD phraseWhat it means
"Python 3"Production-grade Python — typed, tested, modular. Not "scripts."
"PostgreSQL"The system of record for application data and historical predictions. Expect SQL questions on a Postgres dialect (window functions, JSON ops, indexing intuition).
"AWS infrastructure (EC2, S3, RDS, Redshift)"Standard for fintech. EC2 for inference services, S3 for training data, RDS for app data, Redshift for analytics. Know what each is for.
"Real-time decision making"Sub-100ms inference. Implies model design constrained by latency — no heavy ensembles in the hot path; precompute features.
"Production code and tests"Pytest, type hints, modules. The bar is "engineer would review my code."

Archetype stack

JD phraseWhat it means
"Python proficiency for data cleaning, API calls, dataset preparation"NumPy, pandas, requests-style API calls to Newton. Less production-engineering, more applied scripting.
"Jupyter Notebooks"The primary work environment, not a deliverable. Iterate fast.
"Time-series and signal visualization"Matplotlib + spectrograms + correlograms. You'll be staring at sensor traces.
"Encord (bonus)"A specific labeling tool. They use it. Know what it does even if you haven't used it.
"Prompt engineering and fine-tuning workflows"You write prompts and configure lens parameters. Fine-tuning is more limited; the platform exposes lenses with hyperparameters.

The shared spine

Strip away the seniority and the domain, both roles share five core competencies:

  1. End-to-end ownership. From raw data to shipped artifact. No "ML engineer takes it from here" handoff.
  2. Domain insight drives modeling. Both JDs explicitly value depth in their respective domains (fraud / sensor data) over methodological novelty.
  3. Iterative preprocessing. The work is in the data, not the model — feature engineering for SentiLink, signal processing and prompt iteration for Archetype.
  4. Production discipline. Even Archetype's notebook-heavy workflow expects results to be reproducible and reliable.
  5. Cross-functional fluency. SentiLink: work with engineering, risk ops, data acquisitions. Archetype: support Solutions Architects, Sales Reps, customers.

The seniority delta

SentiLink is staff (6+ yrs PhD / 8+ yrs MS). Archetype is mid-level (4+ yrs). The delta shows up in:

  • Autonomy and scope. SentiLink expects you to own a fraud domain as the technical authority. Archetype expects you to own assigned POCs. Both are "ownership," at different scopes.
  • Architecture vs application. SentiLink staff DS shapes the modeling approach for an entire fraud product line. Archetype DS applies an existing platform to customer data.
  • Influence on the technical bar. Staff-level signal is setting standards for the team; mid-level signal is hitting them.
  • Comp. SentiLink $220–260k + equity; Archetype not posted but assume $150–200k base for the seniority and on-site SF Bay.

Soft signals to expect

  • "Deep domain understanding" (SentiLink) / "thrive in high-ambiguity" (Archetype). Both signal that the work isn't well-scoped. You'll be defining the work as much as doing it.
  • "Production code and tests" (SentiLink) / "bias for action" (Archetype). Different framings of the same expectation: ship things, don't deliberate.
  • Corporate values list (SentiLink): "Follow Through. Deep Understanding. Whatever It Takes. Do Something Smart." Pattern-match your stories to these in behavioral rounds.
  • "Startup-ready mindset" (Archetype). They want someone who's been in early-stage chaos before, not a first-time startup hire.

What to ask them

SentiLink-flavored

  • "How does the staff DS role on Application Fraud differ from the role on Identity? Where do their problems overlap?"
  • "What does the average week look like for someone two years into this role?"
  • "How do you handle the tradeoff between rigorous experimentation and shipping fast in production?"
  • "How does the DS team work with engineering on real-time inference performance?"

Archetype-flavored

  • "Walk me through a recent POC end-to-end — what came in from the customer, what came out, what took the most iteration?"
  • "Where does the boundary sit between platform engineering and the work this role does?"
  • "What's the typical cycle time from a customer asset arriving to a testable result?"
  • "How do you handle a POC that's looking infeasible — what's the kill or escalate path?"

Universal

  • "What does failure look like in this role at 90 days? At a year?"
  • "What's the trajectory from this role to the next?"
  • "What does the team feel under-resourced on right now?"