ENGINEERING-BASED SEO

Technical SEO

From crawl budget to Core Web Vitals, from schema architecture to hreflang clusters; a technical SEO program that ties rankings to engineering discipline rather than guesswork.

Ranking is not luck; it is the result of whether the system has been built correctly.

Modern SEO is a far deeper engineering problem than a keyword list. How Googlebot crawls your site, which URLs it spends budget on, how JavaScript weight breaks rendering, which template has bad Core Web Vitals, whether the schema graph is enough for entity linking, whether the internal link architecture carries authority to the right pages — all of these are measurable, auditable and systematizable topics. Roibase's technical SEO team rejects the 'produce content, the rest will come' approach: we first build, measure and improve every layer of site engineering; rankings follow naturally, without the need for panic-driven content production.

Roibase perspective

METHODOLOGY

Our operating framework

In technical SEO the 'do an audit, throw a list, move on' model no longer works. Roibase runs a six-layer engineering framework — each layer ships a measurable deliverable and a sustaining loop refreshes every month.

01

DISCOVER

Technical & business discovery

We merge Search Console, GA4, Bing Webmaster, log files and crawl data; we place the gap between business goals (revenue, lead, brand) and technical reality on a single slide.

02

ARCHITECT

Site engineering & template map

URL taxonomy, internal link architecture, schema graph, render strategy (SSR/SSG/ISR) and template-level CWV budget are tied to a single engineering document.

03

EXECUTE

Sprint-based execution & content refresh

An 8-12 item quarterly roadmap scored by impact x effort x difficulty; each sprint output passes QA and ships with developer tickets.

04

MEASURE

BigQuery + Looker dashboard

Impressions, CTR, average position, conversions, AI search share, schema validation rate and crawl distribution tracked daily on one screen; alerts fire automatically.

05

DEFEND

Algorithm & competitor monitoring

We monitor Google core update signals, SERP volatility and top-10 competitors' technical moves 24/7; when impact occurs, we share a root-cause report within 48 hours.

06

ITERATE

Monthly review & roadmap refresh

At each month-end, outcomes, hypotheses, deviations and new opportunities are compared; the next sprint's priority is updated on numerical evidence.

— COMPARISON

The difference between technical SEO approaches

Very different worlds can sit behind the same deliverable. There are three typical approaches in the market; we build the engineering-based model to close the gaps of the other two.

CriterionDIY / in-house juniorClassic SEO agencyRoibase engineering-based
Crawl budget managementNot doneMonthly manual checkWeekly log stream + automated alerts
Render auditLighthouse score is enoughMobile-friendly test screenshotTemplate-by-template render diff via Puppeteer + URL Inspection API
Schema architectureFAQ + breadcrumbJSON-LD on some templatesFull graph across 11+ types + entity disambiguation
Core Web VitalsGreen in PSI is enoughShown in monthly reportTemplate-level budget + live field-data alerts
Internal link strategySprinkled based on contentPillar-cluster suggestionsInternal PageRank simulation + authority flow optimization
Migration safetyFingers crossed301 list after the factPre-migration audit + canary launch + post-launch monitoring
Algorithm response timeUnclearReport after 1-2 weeksRoot cause + action list within 48 hours
ReportingGA + Search Console screenshotMonthly PDFLooker dashboard + alerted BigQuery + sprint review

PROOF

Outcomes, measured

3.2x
Organic traffic growth

Average at month 12 across engagements — data from 24 clients.

84%
Core Web Vitals green rate

Across accounts that started at 24%; mobile + desktop average.

-68%
Wasted crawl budget

Average drop after log cleanup + index hygiene.

18
Language/country scale

Hreflang cluster management on a single site architecture.

48 hours
Algorithm response time

Root cause + action list delivery time after a core update.

5-7 days
Brief -> publish time

With an optimized content ops pipeline (previously 2-3 weeks).

WHAT WE DO

Engagement scope

Every offering is an outcome-based work package. Roibase blends strategy and execution inside a single team — no hand-offs.

01 / 10

Log file analysis & crawl budget management

We parse server logs for Googlebot, Bingbot and LLM crawlers (GPTBot, ClaudeBot, PerplexityBot); how many requests each template receives, which URL groups eat budget for nothing, which 404/302 chains keep getting crawled — shown in a weekly report. We increase the share of crawls reaching critical pages by an average of 3-5x.

02 / 10

JS render audit & SSR/SSG architecture

We test how CSR pages are rendered by Googlebot using Puppeteer + URL Inspection API; we solve hydration bottlenecks in React/Vue/Next/Nuxt projects and, if needed, produce a migration roadmap to SSR/SSG. We prevent JS weight from capping your rankings.

03 / 10

Schema.org & entity graph architecture

We build a full JSON-LD tree for Organization, Product, Article, FAQ, HowTo, BreadcrumbList, Person, Service, Review, Event and Offer. Schema is not just about visibility; it is the foundation for entity disambiguation in generative search engines — rich results + GEO citations are targeted in parallel.

04 / 10

International SEO & hreflang cluster

We set up hreflang clusters that scale to 7 languages + 18 countries and define the x-default fallback correctly; the sub-folder vs sub-domain decision is made based on CTR, authority flow and operational cost. We fix local rankings damaged by wrong hreflang within 4-6 weeks.

05 / 10

Core Web Vitals & field data management

LCP < 2.5s, INP < 200ms, CLS < 0.1 — these numbers are not targets but starting points. We track CrUX and PSI field data by template and alert within 24 hours when regression appears. From image preloading to critical CSS, we own every lever in LCP optimization.

06 / 10

Internal linking & topical authority

We build the topical authority map from a query matrix; we analyze pillar/cluster architecture, anchor text distribution, breadcrumb depth and internal PageRank distribution. 30-40% of most sites' traffic is the result of authority routed to the wrong page — fixing this alone drives serious growth.

07 / 10

Index hygiene & cannibalization audit

We extract cannibalization signals from Search Console + log + GA4 and clean up 30+ pathologies like soft 404, indexed but not in sitemap, duplicate without canonical. We keep the pages that should stay in the index and manage the rest with the right signal via noindex/410/canonical.

08 / 10

Content ops & query matrix

We do optimization, not production: query intent clustering, SERP feature mapping, content gap analysis, brief production, internal link suggestions and a refresh loop. We set up the operational pipeline to work with your content team (in-house or freelance) and cut the brief-to-publish time from 2-3 weeks to 5-7 days.

09 / 10

Migration & replatform risk management

Before a CMS change, domain migration, design system transition or move to headless, we run a technical risk assessment, build a URL map + 301 chain plan + rollback strategy. A pre-migration audit + post-launch monitoring is mandatory to avoid the '40% traffic loss after migration' story.

10 / 10

Algorithm update response protocol

During big waves — core update, helpful content update, spam update or site reputation abuse update — we produce an impact analysis within 48 hours: which pages were affected, why, how long recovery takes and which fixes take priority. Systematic response, not panic.

— OUTCOMES

What the technical SEO program delivers for you

Beyond page rankings we produce commercial and operational outcomes. Here are 6 concrete results our typical client sees within 90-180 days.

+45% annual organic revenue

Predictable organic revenue

We forecast impressions, CTR and conversions on a quarterly basis; you gain an SEO model that speaks the same language as your finance team.

120+ technical debt items cleared

Escape technical debt

We systematically clean up old URLs, broken schemas, hydration anomalies and cannibalization stacks; the engineering team can focus on new features.

TOP 5 citation in 90 days

Visibility in AI search

Investment in schema + entity graph directly reflects into citations in ChatGPT, Perplexity and AI Overviews; we run the classic SEO + GEO program together.

5-7 day publish speed

Operational speed

Briefs, internal link suggestions, schema templates and QA automation speed up your content production 3-5x; your team keeps up with sprints.

<2% traffic loss

Migration safety

We drive the traffic-loss risk close to zero during a replatform, domain migration or CMS change; no 'we'll fix it later' stress.

30-45 day recovery

Algorithm resilience

We take a proactive, not reactive, approach to core updates; when impact occurs, we share an action list within 48 hours and typically recover within 30-45 days.

DELIVERABLES

What we ship inside the service scope

Concrete artifacts you receive at the end of every sprint — we ship files and systems rather than verbal reports.

  • Technical SEO audit document

    40-60 findings, impact x effort scores, owners assigned in a Notion page.

  • Crawl & log report

    Weekly Googlebot behavior report, anomalies, crawl frequency of critical URLs.

  • Render audit report

    Puppeteer screenshot diffs, hydration metrics, SSR/SSG recommendations.

  • Schema implementation package

    JSON-LD snippets for all templates + validator output + test links.

  • Internal link architecture map

    Pillar/cluster diagram, anchor text matrix, internal PageRank simulation.

  • CWV optimization plan

    Template-level action list for LCP, INP, CLS + developer tickets.

  • hreflang cluster configuration

    XML sitemap + HTML tag + HTTP header triple, x-default fallback included.

  • Content brief template

    Query intent, SERP features, internal link suggestions, schema notes, target keyword cluster.

  • BigQuery + Looker dashboard

    Sprint review, alert configuration, competitor monitoring — one dashboard.

  • Migration playbook

    Pre-migration checklist, 301 map, canary launch plan, post-launch monitoring.

  • Algorithm response protocol

    Format for a root cause + action list within 48 hours of a core update.

  • Monthly executive summary

    A one-pager for the C-level: traffic, revenue, sprint outputs, next quarter target.

— SCOPE

What's inside and what's outside the technical SEO program

Transparency is essential — telling clearly what we do is as important as telling clearly what we don't.

What this service covers

  • Technical SEO audit + a 40-60 item action list
  • Weekly log file analysis + crawl budget management
  • JS render audit + SSR/SSG architecture recommendations
  • Schema.org JSON-LD implementation for all templates
  • Core Web Vitals optimization plan + live field data tracking
  • Internal link architecture + topical authority map
  • International SEO + hreflang cluster management
  • Content brief production + editorial pipeline setup
  • Index hygiene + cannibalization cleanup
  • Migration risk management + post-launch monitoring
  • Algorithm update response protocol (48-hour SLA)
  • Monthly BigQuery + Looker dashboard review + roadmap refresh

What this service does not cover

  • Spam links / PBN / black-hat link building
  • Automated AI content generation (without QC)
  • Guaranteed #1 ranking promises (no one can deliver these on Google)
  • Website design & development (we collaborate with partner teams)
  • Social media management & influencer marketing
  • Performance marketing (Google/Meta Ads) — separate service scope
  • Direct press release / PR distribution
  • Post-sales technical support / hosting management

HOW WE WORK

Rankings by system-building, not guessing.

01

Week 1 — Technical & business discovery

Search Console, GA4, log files, crawl data and business goals are synced; a baseline report is produced.

02

Week 2 — Audit & priority matrix

A 40-60 finding technical audit scored by impact x effort x difficulty; the first-quarter roadmap is presented on one slide.

03

Week 3-4 — Quick win implementation

Index hygiene, robots/sitemap fixes, critical schema injection, obvious CWV regressions — the first concrete impact appears in this sprint.

04

Week 5-6 — Architectural refactor

URL taxonomy, internal link structure, render strategy and template CWV budgets are implemented with the engineering team.

05

Month 2 — Content ops pipeline

Query matrix, brief template, content refresh loop, editorial QA. Content production speed rises 3-5x; the pillar/cluster structure settles in.

06

Month 3 — Authority building & link earning

Digital PR, partner content, production of high-source-value assets (reports, calculators, datasets); a natural backlink flow is triggered.

07

Month 4 — SoV defense & competitor monitoring

A defensive sprint to protect market share: competitor technical moves, SERP feature losses and AI Overviews visibility tracked on one board.

08

Month 5+ — Monthly iteration & scaling

Sprint review, fixing deviations, new language/country/product launches, executive summaries. The program becomes a permanent capability.

— ECOSYSTEM

The platforms & tools we use

Pivotal tools per category — we integrate with the client's existing stack whenever possible and do not force new licenses.

CRAWL & INDEX

Screaming Frog SEO SpiderSitebulbOnCrawlJetOctopusGoogle Search Console APIBing Webmaster ToolsURL Inspection APILogz.io / ELK stack

RENDER & PERFORMANCE

PuppeteerWebPageTestLighthouse CIPageSpeed Insights APICrUX dashboardCalibre / SpeedCurve

CONTENT & QUERY

AhrefsSemrushSistrixSurfer SEOClearscopeAlsoAskedKeyword Insights

REPORTING & WORKFLOW

BigQueryLooker StudioGA4 enhancedNotion / ConfluenceLinear / JiraSlack alert bot

QUESTIONS

Frequently asked

Early signals from technical improvements (CWV, schema, index hygiene) appear in Search Console within 4-8 weeks. Meaningful ranking shifts from content + authority come within 3-6 months. Roibase ships the technical baseline and quick wins in the first 30 days; revenue impact becomes measurable from day 90-180.

— GLOSSARY

Technical SEO glossary

Engineering-based SEO carries its own vocabulary. Short definitions for the 12 concepts we use most, so we speak the same language.

01
Crawl budget
The total crawling capacity Googlebot allocates to your site. Determined dynamically by server speed, site authority and content freshness; when wasted, critical pages get indexed late.
02
Indexability
The set of attributes that decide whether a URL is eligible to enter Google's index: meta robots, canonical, hreflang, status code, content quality and crawl accessibility evaluated together.
03
Core Web Vitals (CWV)
The three-metric set Google uses to measure user experience: LCP (largest content paint), INP (interaction response time) and CLS (visual shifts). Directly factors into rankings.
LCPINPCLS
04
LCP / INP / CLS
LCP is the time for the largest content element to appear (<2.5s), INP is interaction response time (<200ms), CLS is visual shift during load (<0.1). All three must be 'good'; if one is bad, the template's CWV is considered failing.
05
Hydration
The process by which server-rendered HTML is 'brought to life' by JavaScript in the browser. Slow hydration breaks INP; if JS errors, content appears but becomes unclickable.
06
Schema.org JSON-LD
A structured-data format that makes page content machine-readable. The foundational signal for rich-result eligibility, entity disambiguation and LLM citations.
07
hreflang
The <link rel="alternate" hreflang="x"> directive that tells search engines about language/region variants of the same content. In a multilingual architecture it preserves canonical distribution, avoids ranking in the wrong language and self-cannibalisation; deployed reciprocally with an x-default.
08
Topical authority
The aggregate signal that shows your site's expertise depth on a particular topic cluster. A result of pillar/cluster architecture, internal link density and E-E-A-T components.
09
Internal PageRank
The classic PageRank algorithm simulated across your site's internal link structure. Measures how much authority each page accumulates from your own site; authority flow directly impacts ranking power.
10
E-E-A-T
Experience, Expertise, Authoritativeness and Trustworthiness. Google's quality evaluation framework; decisive for YMYL topics, supportive in other areas.
11
SERP volatility
A measure of daily fluctuation intensity in Google rankings. High volatility typically signals an algorithm update; tracked via tools like Semrush Sensor and Mozcast.
12
Cannibalization
A situation where two or more pages serving the same intent compete for the same query. Both pages rank lower and Google becomes uncertain which to show; resolved by consolidation or intent separation.
13
Hreflang
The <link rel="alternate" hreflang="x"> directive that tells search engines about language/region variants of the same content. In a multilingual architecture it preserves canonical distribution, avoids ranking in the wrong language and self-cannibalisation; deployed reciprocally with an x-default.
14
Canonical Tag
The <link rel="canonical"> meta that points search engines to the "primary" URL of identical or near-identical content. Consolidates signal across filtered categories, parameterised URLs, AMP/mobile alternates and syndicated copies; misuse can cause index hijacking.
15
Structured Data
Markup that makes page content machine-readable using the schema.org vocabulary. A prerequisite for rich-result, AI Overviews and voice eligibility; the most common carrier is JSON-LD, validated in Search Console's Rich Results report.
16
JSON-LD
The most common structured-data syntax — schema.org markup carried as JSON inside a <script type="application/ld+json"> block. Decoupled from the HTML, friendly to server-side rendering, and Google's recommended format.
17
XML Sitemap
An XML file listing the site's indexable URLs along with last-modified time and change frequency for search engines. Large sites combine sitemap index files with hreflang annotations; the file is referenced from robots.txt.
18
robots.txt
A plain-text file at the site root that tells crawlers which paths they may crawl. Carries per-user-agent Allow/Disallow directives, the Sitemap reference and — increasingly — AI-crawler hints like LLM-Content; a wrong Disallow can deindex pages.

— DECISION TREE

Is the technical SEO program right for you now?

Four short questions. In 30 seconds we tell you the right starting point — full program, content-first model, foundation before an architectural refactor, or strategic discovery.

01 / 04

Has your site lost organic traffic or revenue in the last 6-12 months?

Trend check via Search Console + GA4.

— LET'S BEGIN

How efficiently is your site architecture telling your story to Google?

We scan crawl, render, schema and ranking signals in 72 hours and show, in a single report, which stones are slowing you down and which gains are reachable in the next 90 days.