SerpHaus

Crawled But Not Indexed: What Google Actually Knows About Your Backlinks

An agency owner asks the hard questions about PBN links, authority networks, and whether indexation actually matters. The answers aren’t what the industry is telling you.

The following is an edited transcript of a conversation that started with a simple question — “do my PBN links still work if they’re not indexed?” — and turned into a deeper examination of what Google actually evaluates at crawl time, why the term “PBN” is misleading for authority-grade networks, and where the industry’s assumptions fall apart under scrutiny.

The PBN Anxiety

Cisco: I need to ask you something directly. I’ve been buying PBN links for years — including from your network. And I keep hearing from people in forums, in Slack groups, on Twitter, that PBN links are dead. That if the page isn’t indexed, the link has zero value. Some people are saying Google can identify PBN networks now and just discounts everything. Should I be worried?

Tom: Let’s start by pulling apart what you’re actually asking, because there are three different concerns buried in that question and they each have different answers. First — is the page indexed? Second — does Google know it’s a “PBN”? Third — does the link have value? The industry treats these as the same question. They’re not.

But before we get into any of that, I need to push back on the term itself. You said “PBN links.” I use that term in marketing because it’s what the industry searches for — it’s the lingo people know. But what we actually operate isn’t a PBN in the way most people understand it.

Cisco: What do you mean? What’s the difference?

Tom: The traditional PBN — the thing Google has been fighting for a decade — is a network of expired domains with thin, auto-generated content, no real editorial value, existing purely to pass link equity. Shared hosting, obvious footprints, no real audience, no topical depth. That model deserves every hit it’s taken. What we’ve built is fundamentally different. We call it an authority network because that’s what it actually is. These are sites with real editorial content, topical relevance, proper structure — built around Google’s own standards: E-E-A-T, helpful content, editorial value. That distinction isn’t marketing spin. It’s the difference between a site Google crawls and sees thin template content, versus a site Google crawls and sees topically relevant editorial content that happens to link out to relevant resources.


What Google Actually Evaluates at Crawl Time

Cisco: Okay, but even if the content is good — if the page isn’t indexed, does the link still do anything?

Tom: This is where the industry has created a false binary. “Indexed = valuable, not indexed = worthless.” It’s more nuanced than that.

Confirmed by Google

Crawling and indexing are separate processes. Google crawls far more pages than it indexes. Googlebot visits a page, processes the content, follows outbound links, encounters anchor text, registers the referring domain. All of that happens at crawl time, before any indexation decision is made. A page not being indexed means Google decided it wasn’t worth showing as a search result. That’s a content surfacing decision — not a signal processing decision.

Cisco: But doesn’t that mean the link doesn’t pass PageRank?

Tom: This is where I have to be honest with you, because the industry isn’t being honest about this. The prevailing expert view is that direct PageRank transfer likely requires the source page to be indexed — based on Google’s Nearest Seed PageRank patent. But here’s what nobody tells you: there is no empirical data confirming this. Google has never made a direct public statement saying links from crawled-but-not-indexed pages don’t pass value. The patent describes a mechanism, not a confirmed implementation. Anyone telling you definitively yes or definitively no is selling something.


Why Authority Sites Change the Equation

Cisco: So if PageRank transfer is uncertain, why does it matter whether the site is an authority network or a traditional PBN?

Tom: Because PageRank isn’t the only thing Google processes at crawl time. And this is where the quality of what’s on the page changes everything.

When Google crawls a traditional thin PBN page, here’s what it encounters: a template layout, filler content, and an outbound link. The contextual signals surrounding that link are essentially zero. There’s no topical relevance, no editorial depth, no E-E-A-T signals.

When Google crawls a page on an authority-grade site — real content, topical depth, editorial structure — the contextual signals are fundamentally different. The link sits within relevant content. The anchor text appears in a natural editorial context. Google encounters a page that looks like what it wants the web to be.

Build Philosophy

Google told us exactly what they’re looking for — E-E-A-T, helpful content, editorial value, topical authority. We built the network around those standards. Not because we’re trying to game an algorithm, but because sites that meet those standards are the sites Google trusts. That’s not a loophole. That’s alignment.

Cisco: But if Google’s getting smarter at identifying networks, doesn’t that still put you at risk?

Tom: Google identifies networks by looking for the things that make traditional PBNs obvious — shared hosting infrastructure, thin content, identical templates, unnatural link topologies. Those are the footprints. An authority network built with unique editorial content, topical depth, proper E-E-A-T signals, and natural link patterns doesn’t have those footprints. What matters to Google isn’t who owns the site. It’s what’s on the page.


Deindexed vs. Never Indexed

Cisco: One more thing — people use “deindexed” and “not indexed” interchangeably. Is there a difference?

Tom: A big one, and conflating them leads to bad conclusions.

Crawled But Not Indexed

Google visited the page, processed its content, and decided it wasn’t worth showing as a search result. A neutral editorial decision — Google makes this call on billions of pages across the web. It doesn’t imply anything negative about the site.

Deindexed

The page was in the index and Google actively removed it. A qualitative judgment changed — either the page degraded, a policy shifted, or an algorithm update reclassified it. Deindexation can carry negative connotations: manual actions, spam detection, quality reclassification.

When people say “my PBN links got deindexed,” they usually mean the pages were never indexed in the first place — which is expected for many page types across the web. But they’re using language that implies Google took action against the site, which is a much more serious claim.


Where’s the Data?

Cisco: I appreciate the honesty, but let me push you on this. Is there actual data backing up the idea that these links have value?

Tom: Here’s exactly what we have, separated into what’s verifiable and what’s inferred.

Verifiable
GSC Referring Domains

Links from crawled-but-not-indexed pages show up as referring domains in Google Search Console. This is measurable, repeatable, and not subject to debate. Google is recording the link relationship from these pages.

Verifiable
Anchor Text at Crawl Time

Googlebot processes page content at crawl time, including outbound links and their anchor text. This happens before any indexation decision. On authority-grade pages with editorial content, that anchor text exists within rich topical context.

Verifiable
Entity Signals

Google’s Knowledge Graph aggregates brand mentions from across the crawled web. Consistent brand presence across authority sites with real editorial content contributes to entity recognition independently of whether individual pages are indexed.

Inferred
PageRank Transfer

The prevailing expert view, based on the Nearest Seed PageRank patent, is that direct PageRank transfer likely requires the source page to be indexed. Logically sound but never empirically confirmed. Google has never made a direct statement on this.

So three of the four signal types are directly observable. The fourth — which happens to be the one the entire industry fixates on — is the one nobody can prove either way.


The Right Question

Cisco: So what should I actually be looking at when I evaluate my backlinks?

Tom: Stop asking “is my backlink indexed?” and start asking “what is Google encountering when it crawls this page?”

If the answer is a thin template with a link and nothing else — yeah, you should be concerned. Not because the link is necessarily worthless, but because there’s nothing on that page giving Google a reason to value the signals surrounding your link.

If the answer is a page with real editorial content, topical relevance, E-E-A-T signals, and your link sitting naturally within that context — you’re in a fundamentally different position.

The industry made two mistakes. First, treating all non-indexed links identically — a thin template page and an authority-grade editorial page are not the same thing just because neither is indexed. Second, calling everything a “PBN” when the range of what that term covers is enormous. A network of expired domains with spun content and a network of authority sites with genuine editorial value are as different as a strip mall and a university library.

Questions about this? Email me directly at tom@serphaus.com

The Principle

Google still crawls what it won’t index. What value that carries depends entirely on what’s on the page when Google gets there. We built the network around that principle — not around tricking Google, but around giving Google exactly what it says it wants to see.

← Read the companion piece: Profile Links in 2026 — Why Google Still Crawls What It Won’t Index

Scroll to Top