There are many ways to drive traffic to a website, but ranking well on Google remains the most durable source of qualified visitors. The problem is that most “ranking factors” articles recycle the same list of 200 items, treating confirmed signals, reasonable guesses, and long-debunked myths as if they carried equal weight.
This guide takes a different approach. For each factor, we tell you whether it is confirmed by Google, strongly supported by evidence (including the 2024 Content Warehouse API leak and documents from the DOJ antitrust trial), or debunked. That distinction matters because optimizing for myths wastes budget, and in regulated verticals it can actively hurt you.
A quick note on the famous “200 ranking factors” figure: Google used that number in marketing material years ago, but its representatives have since said the real picture is far more fluid. Modern ranking is driven by machine learning systems that evaluate hundreds of signals in combination, not a static checklist. Think of the factors below as inputs those systems weigh, not levers you pull one at a time.

Table of Contents
- How Google’s Ranking Systems Work
- Domain Factors
- On-Page and Content Factors
- Title tags
- Meta descriptions
- Headings and structure
- Keyword usage, frequency, and the LSI myth
- Content length and depth
- Originality and the deduplication problem
- Freshness and content updates
- Multimedia and image optimization
- Outbound and internal links
- Readability, spelling, and grammar
- Schema markup and structured data
- Technical and Site-Level Factors
- Backlink Factors
- User Interaction and Engagement Signals
- Brand Signals and E-E-A-T
- Local and Niche Relevance
- Spam Detection and Demotion Systems
- Debunked: What You Can Stop Worrying About
- Conclusion
- FAQs
How Google’s Ranking Systems Work
Before the individual factors, it helps to understand the systems doing the evaluating. Knowing which system responds to which signal tells you where to focus.
PageRank is the original algorithm: links act as votes, and votes from authoritative pages count for more. PageRank still exists in updated forms and remains foundational to how authority flows through the web.
Hummingbird (2013) rebuilt the core engine around meaning rather than keyword matching. It is why exact-match phrasing matters less than covering a topic well.
RankBrain (2015) was Google’s first deep learning ranking system. It helps Google interpret queries it has never seen before and adjust how signals are weighted for different query types.
BERT (2019) and MUM (2021) are language models that understand context, prepositions, and intent. BERT understands what a query means; MUM goes further, working across languages and formats. Neither is something you optimize for directly. You benefit from them by writing naturally and answering the actual question.
The Helpful Content system (2022, folded into the core algorithm in March 2024) evaluates whether a site as a whole produces content for people or for search engines. This is a sitewide classifier: enough unhelpful content can suppress your entire domain, which is why pruning thin pages often lifts everything else.
The Passage Ranking system lets Google rank a specific section of a page even when the page as a whole is not the best match. Clear headings and well-structured sections give your content more chances to surface.
Panda (content quality) and Penguin (link spam) were once separate updates; both have been absorbed into the core algorithm and run continuously. Penguin’s job has largely been taken over by SpamBrain, Google’s AI-based spam detection system, which identifies both spammy sites and unnatural links.
Navboost, revealed in detail during the DOJ antitrust trial, uses aggregated click behavior over a 13-month window to refine rankings. Its existence confirmed what SEOs long suspected: user interaction data influences results, even though Google spent years publicly downplaying it.
Google also runs query-level adjustments such as Query Deserves Freshness (recent content gets a boost for time-sensitive searches) and Query Deserves Diversity (ambiguous queries return a mix of interpretations). These explain why “best smoker” returns products, reviews, and recipes all at once.
Domain Factors
Domain-level signals are weaker than most people assume, but a few are worth understanding.
Domain age and history
Domain age by itself is not a ranking factor, and Google has said so repeatedly. What correlates with age is accumulated authority: older domains have had more time to earn links and build a track record. Domain history is the part that genuinely matters. If a domain previously hosted spam, ran link schemes, or sat in a bad neighborhood, that baggage can follow it. Always check a domain’s backlink profile and archive history before buying it.
The 2024 API leak confirmed Google stores registration data and tracks domain changes, consistent with its patent describing registration and renewal dates as signals for distinguishing legitimate domains from throwaway doorway domains. The practical takeaway is modest: register your domain for multiple years because it is cheap and removes any doubt, not because it will move rankings.
Exact-match and keyword domains
Exact-match domains (plumbersottawa.com) once carried real weight. The EMD update in 2012 stripped most of that advantage from low-quality sites. Today a keyword in your domain provides at most a small relevance hint, and its real value is click psychology: a searcher seeing their keyword in your URL may be more likely to click. Do not sacrifice brandability for it. A memorable brand name generates branded searches, and branded searches are a far stronger signal than a keyword in the domain.
Country-code TLDs
A ccTLD (.ca, .co.uk, .fr) is a strong geo-targeting signal. It helps you rank in that country and limits you elsewhere. Use a ccTLD when your business genuinely serves one country; use a gTLD (.com) with hreflang and localized content when you serve several. And keep language consistent with targeting: Russian content on a .fr domain sends Google contradictory signals.
Whois privacy and ownership
Private Whois data is not a penalty trigger on its own; millions of legitimate sites use it. But combined with other spam signals, hidden ownership fits the pattern of someone with something to hide. More important: if one site in your portfolio receives a manual action for egregious spam, Google can connect ownership across properties. Operators of site networks should treat every property to the same standard, because the weakest one can taint the rest.
On-Page and Content Factors
Page-level optimization is where you have the most direct control, and where the most outdated advice circulates.
The title tag remains one of the strongest on-page relevance signals Google has confirmed. Put your primary keyword near the front, keep titles unique across the site, and write for the click, since the title is also your headline in search results. Sitewide duplicate titles make it harder for Google to distinguish your pages and dilute relevance. Note that Google rewrites titles it considers poor, so a mismatch between your title and your content invites Google to substitute its own.
Meta descriptions
Meta descriptions are not a ranking factor, and Google has confirmed this. They are a click-through factor. Google rewrites the majority of them anyway, often pulling your opening paragraph, which means your first 150 words should be tight, specific, and compelling. Write descriptions for high-priority pages, but do not lose sleep over duplicates on long-tail pages.
Headings and structure
Your H1 should state plainly what the page is about, and it is a confirmed (if modest) relevance signal. Subheadings (H2 through H4) matter more than most realize because of passage ranking: each clearly labeled section is a candidate to rank for its own query. Use headings to organize for readers first; the SEO benefit follows from the clarity.
Keyword usage, frequency, and the LSI myth
Your target keyword should appear in the title, H1, URL, opening paragraph, and naturally throughout the body. Beyond that, stop counting. The old “3% keyword density” rule is a myth with no basis in how modern retrieval works, and mechanical repetition reads as stuffing to both users and SpamBrain.
“LSI keywords” deserve a special mention because the term refuses to die. Latent Semantic Indexing is a 1980s technique that Google has explicitly said it does not use. What people mean by the term, covering related concepts, subtopics, and natural variations, is genuinely valuable. Systems like BERT and MUM understand topics semantically, so a page about smokers that also discusses temperature control, wood types, and cook times demonstrates depth. Just understand you are writing comprehensively for a language model, not sprinkling magic synonyms.
Content length and depth
Two thousand words is not optimal. No specific word count is. Google’s John Mueller has said word count is not a ranking factor, full stop. Long-form content correlates with rankings because thorough coverage tends to require length, not because length is rewarded. The standard is coverage relative to intent: a “how to winterize a Kamado” query might need 1,500 words, while “Kamado Joe ash tool replacement” needs 200 and a product link. Padding the second to hit a word target makes it worse, and the Helpful Content system is designed to notice.
Originality and the deduplication problem
Original information, analysis, testing, and first-hand experience are what separate content Google wants to rank from content it filters out. Duplicate content does not trigger a penalty in the classic sense; instead, Google’s deduplication systems pick one canonical version and suppress the rest. If your version is the suppressed one, you simply do not appear. Use canonical tags to consolidate near-duplicates you control, and treat syndication carefully: syndicated content should point a canonical back to the original, or you risk being out-ranked by your own article on someone else’s site.
Boilerplate is a related issue. A 300-word call to action repeated on every page dilutes the unique content ratio of the entire site. Keep templated blocks lean.
Freshness and content updates
Freshness matters most where Query Deserves Freshness applies: news, prices, evolving best practices, anything with a date sensitivity. For evergreen topics, cosmetic updates do nothing. The leak documentation suggests Google evaluates the magnitude of a change, so rewriting a section with current information counts in a way that fixing typos and bumping the date does not. A genuine content refresh program, where you periodically re-verify facts, update examples, and expand thin sections, serves both rankings and readers. Faking freshness by changing dates is a pattern Google specifically watches for.
Multimedia and image optimization
Unique images, diagrams, and video signal original effort and improve engagement. Optimize them properly: descriptive file names (use hyphens, not underscores, since Google treats hyphens as word separators), accurate alt text, compressed file sizes, and lazy loading below the fold. Alt text should honestly describe the image; Google’s vision systems can tell what is in a picture, so keyword-stuffed or misleading alt tags hurt more than they help. Heavy uncompressed images remain one of the most common causes of failed Core Web Vitals.
Outbound and internal links
Outbound links to authoritative, relevant sources will not directly raise your rankings, but they help classify your content and are part of how quality raters assess whether claims are referenced. Google also analyzes outbound link themes to catch sites selling links. Link out when it serves the reader, use descriptive anchor text, and apply rel=”sponsored” or rel=”nofollow” to affiliate and paid links.
Internal links are underrated. They distribute authority through your site, signal which pages you consider important, and define topical relationships. Your most valuable pages should be reachable within a few clicks of the homepage and linked from relevant content with descriptive anchors. The leak confirmed Google measures internal link prominence, validating what site architects have practiced for years.
Readability, spelling, and grammar
Google has said spelling and grammar are not direct ranking factors, but the indirect path is short: sloppy writing erodes trust, trust affects engagement, and engagement feeds Navboost. Write at the reading level your audience expects. For consumer health content that means clear and accessible; for a developer audience, simplifying too far reads as thin.
Schema markup and structured data
Structured data is not a direct ranking factor, but it is one of the highest-ROI technical tasks available because it earns rich results: review stars, FAQ expansions, breadcrumbs, event listings. Rich results occupy more SERP real estate and lift click-through rate, and CTR feeds back into the user interaction signals discussed below. Implement Organization, LocalBusiness, Article, FAQ, and Breadcrumb schema where they genuinely apply, and never mark up content that is not visible on the page, which is a manual action waiting to happen.
Technical and Site-Level Factors
Page speed and Core Web Vitals
Page speed has been a confirmed ranking factor since 2010 on desktop and 2018 on mobile, formalized through Core Web Vitals: Largest Contentful Paint (loading), Interaction to Next Paint (responsiveness), and Cumulative Layout Shift (visual stability). The direct ranking weight is modest, functioning mostly as a tiebreaker among comparable pages. The indirect weight is large: slow pages bleed users before the content loads, and abandoned visits are exactly the behavior Navboost aggregates. Fix speed for the user data it produces, not just the checkbox.
Mobile optimization
Google has used mobile-first indexing for all sites since 2023, meaning the mobile version of your site is the version Google evaluates. A desktop site that hides content on mobile is hiding it from Google. Responsive design is the default answer. Test real templates on real devices; an “optimized” site with unreadable tap targets and intrusive interstitials fails where it counts.
HTTPS and security
HTTPS has been a confirmed lightweight ranking signal since 2014, and in practice it is table stakes: browsers flag non-HTTPS pages as not secure, which destroys trust before ranking even enters the picture. Ensure certificates are valid, renewals are automated, and every HTTP variant 301-redirects to a single canonical HTTPS version.
Site architecture and crawlability
A logical structure where related content clusters together helps Google understand your site and helps authority flow to the pages that earn it. Keep important pages shallow, use breadcrumb navigation (with BreadcrumbList schema for the rich result), and maintain an XML sitemap. The sitemap will not boost rankings, but it ensures nothing is missed and speeds discovery of new content. An HTML sitemap adds a secondary crawl path on large sites and is genuinely useful to some visitors.
Server reliability, uptime, and hosting changes
Persistent 5XX errors teach Google that pages are gone: it slows crawling, and pages that keep erroring eventually drop from the index. Short outages of a day or two are forgiven; downtime stretching into weeks gets your results suppressed until Google can re-verify the site. When you migrate hosting, expect Google to temporarily throttle crawl rate while it learns what load the new server tolerates, which is normal and recovers on its own. Server location is a minor geo-relevance hint, made mostly irrelevant by CDNs that serve content from edge nodes near the user.
Trust pages
Contact, About, Privacy Policy, and Terms pages are not algorithmic ranking factors in the direct sense, but they map straight onto how Google’s quality rater guidelines define trustworthiness, and the leak references signals consistent with sitewide trust evaluation. A site asking for money or handling YMYL topics with no verifiable contact information fails the most basic trust test. The duplicate-content worry about boilerplate legal pages is unfounded; Google expects privacy policies to look alike.
Backlink Factors
Links remain one of the strongest components of ranking, and Google has confirmed as much, but link evaluation has become dramatically more sophisticated.
What makes a link valuable
The authority of both the linking domain and the specific linking page matter, and the leak confirmed Google computes site-level authority alongside page-level metrics. Beyond raw authority, the signals that compound a link’s value are relevance (a link from an addiction medicine journal to a treatment center outweighs a generic directory listing), placement (contextual links inside main body content carry more weight than footer, sidebar, or author-bio links), the anchor text and surrounding sentiment, and freshness of the linking page. The leak also indicated Google weights links from pages that themselves receive traffic and clicks, which fits the broader pattern: links that real humans actually encounter count most.
Anchor text
Anchor text helps Google understand what the target page is about, and it is also the easiest link signal to over-optimize. A natural profile is dominated by branded anchors, naked URLs, and generic phrases, with exact-match anchors as a small minority. If your exact-match anchor percentage looks like a campaign, SpamBrain agrees with you.
Link diversity
A healthy profile shows diversity across linking domains, IP ranges, link types (editorial, citations, directories, resource pages), and target pages. One hundred links from a hundred relevant domains beats a thousand links from ten. Links from sites sharing an IP or hosting footprint get collapsed in value, which is the fundamental flaw in private blog networks: the infrastructure leaves fingerprints.
Link velocity
Link velocity, the rate at which you gain or lose links, is read in context. A natural spike from a press mention or viral piece is fine. A steady drip of identical-pattern links from low-quality sources looks manufactured. Sustained negative velocity, where links decay faster than you earn them, suggests fading relevance. The goal is steady, content-driven accumulation.
Nofollow, sponsored, and UGC links
Since 2019 Google treats nofollow as a hint rather than a directive, meaning a nofollow link from a major publication can still carry value. Beyond any direct effect, links from places like Wikipedia drive referral traffic, brand exposure, and follow-on citations from researchers and journalists. Mark paid links rel=”sponsored” and user-generated links rel=”ugc”. Referral traffic itself is an overlooked benefit: visitors arriving from a relevant link convert well and generate the engagement signals that compound everything else.
301 redirects
301 redirects pass PageRank, and Google has confirmed there is no inherent loss through a clean redirect. What kills migrations is sloppiness: redirect chains, mapping old pages to irrelevant new ones, or redirecting everything to the homepage. Map old URLs to their closest equivalents, one hop each.
Toxic links and the disavow tool
Links from spam networks, link farms, and irrelevant foreign-language directories are mostly just ignored by SpamBrain rather than penalized, which is why Google now says most sites never need the disavow tool. The exceptions: if you receive a manual action for unnatural links, or you have a documented history of link buying you are cleaning up, disavow is the right instrument. Buying and selling links that pass PageRank remains a guidelines violation on both sides of the transaction. Reciprocal links between genuinely related businesses are fine; reciprocal link schemes at scale are a pattern, and patterns get caught. The same logic applies to forum profile links and comment spam, which stopped working a decade ago but still get sold.
User Interaction and Engagement Signals
For years Google publicly minimized click data while SEOs observed otherwise. The antitrust trial settled the argument: Navboost uses aggregated click and interaction data as a meaningful ranking input. The leak added supporting detail, including the distinction between long clicks and short clicks.
Organic CTR. When your result earns more clicks than its position predicts, that is evidence searchers find it relevant. Titles, descriptions, rich results, and brand recognition all feed this. CTR gains compound: better snippets earn clicks, clicks support rankings, rankings produce impressions.
Long clicks versus pogo-sticking. A long click means the searcher clicked your result and ended their search, the strongest possible vote that you answered the question. Pogo-sticking, where they bounce back to the results and click a competitor, is the opposite. You influence this by matching search intent precisely and putting the answer where it is immediately findable.
Bounce rate, time on page, and pages per session. Google does not consume your Analytics data, and raw bounce rate is not a ranking factor; a user who gets their answer in ten seconds and leaves satisfied is a success. What matters is the search-behavior version of these metrics: did the visit end the search chain or continue it? Build pages that resolve intent, then offer a genuinely relevant next step.
Direct and returning traffic. People who navigate straight to your site or come back repeatedly are demonstrating that your site is a destination, behavior consistent with how Google identifies trusted brands. You cannot fake this; you earn it with content worth bookmarking.
Comments and community. An active, moderated comment section signals an engaged audience and adds fresh supplementary content. Unmoderated comment spam does the opposite. If you cannot moderate, close comments.
The unifying principle: backlinks get you into the top ten, and user behavior determines whether you stay. Navboost needs click data to evaluate you, and you only generate click data once you are ranking, which is why engagement functions as the incumbency test.
Brand Signals and E-E-A-T
Google’s systems show a clear preference for recognizable, trusted brands, partly because brands accumulate exactly the signals the algorithm rewards.
Branded search volume is the cleanest brand signal: people typing your name into Google is demand the algorithm can measure directly. Unlinked brand mentions in news coverage, forums, and industry publications function as implied endorsements; Google has patents describing exactly this. Brand-name anchor text dominating your link profile is what natural linking looks like. Reviews and reputation matter sitewide: the quality rater guidelines explicitly direct raters to research a site’s reputation, and a pattern of negative reviews on BBB, Trustpilot, or Google itself is reputational evidence. For local businesses, review volume, recency, and your responses feed directly into local rankings.
E-E-A-T (Experience, Expertise, Authoritativeness, Trustworthiness) is the framework tying this together. It is not a single score but the lens through which quality raters and, indirectly, the systems trained on their judgments evaluate sites. The stakes scale with the topic: for YMYL (Your Money or Your Life) content covering health, finance, and safety, Google’s systems give far more weight to authority and far less benefit of the doubt. In these verticals, author credentials, medical or expert review, citations to primary sources, and verifiable organizational identity are not optional extras; they are the cost of entry.
Round out the brand footprint with consistent presence on LinkedIn, Facebook, and a complete Google Business Profile. Social shares are not direct ranking factors, and Google has confirmed it does not use them as such, but social distribution gets content in front of the people who do link, cite, and search for you by name. Buying followers that evaporate in a week does nothing but skew your own data.
Local and Niche Relevance
Local rankings run on a partially separate system with three pillars Google states openly: relevance (does your listing match the query), distance (how close you are to the searcher), and prominence (how well-known and well-reviewed you are, online and off).
The practical levers: a fully built Google Business Profile with accurate categories, photos, and posts; NAP consistency, meaning your name, address, and phone number match exactly across your site, GBP, and every citation source, because mismatched NAP data fragments your entity in Google’s understanding; reviews with steady velocity and owner responses; localized content and landing pages for each service area; LocalBusiness schema; and links plus mentions from local sources such as news outlets, chambers of commerce, and community organizations. Google runs a dedicated local news system that surfaces community sources for geo-specific queries, which makes local press coverage doubly valuable.
Niche expertise is the organic-search cousin of local prominence. Google’s systems increasingly reward sites with deep, concentrated topical authority over generalists. A focused site built around pillar content and supporting clusters, what some call a hub keyword strategy, will routinely outrank a higher-authority generalist on specialty queries. Depth beats breadth.
Spam Detection and Demotion Systems
Understanding what Google demotes is half of understanding what it rewards.
SpamBrain is the AI system at the center of modern spam fighting, identifying gibberish and auto-generated content, hacked pages, sneaky redirects, and the link spam that Penguin used to chase. It neutralizes most purchased links silently, which is why bought links so often simply do nothing.
Manual actions are human-issued penalties for violations the algorithms flag for review: unnatural links, thin content, cloaking, structured data abuse. They arrive with a Search Console notification and require a cleanup plus reconsideration request to lift. A manual action on one property can invite scrutiny of everything else you own.
Demotion systems documented in the leak and Google’s own publications include penalties for exact-match-domain abuse, deceptive product review content, and pages with intrusive ads above the fold that push content down. The old Payday Loans update targeting notoriously spammy verticals lives on inside the core algorithm; competitive YMYL niches get the strictest algorithmic treatment, which anyone working in treatment, finance, or legal already knows.
Legal and policy removals also reshape results. Valid DMCA complaints demote repeat-infringer domains, and Google’s personal information removal processes can demote sites built on exposing personal data. Hidden content designed to manipulate (white text on white, CSS-hidden keyword blocks, cloaked pages showing Google different content than users) remains a classic violation. Content hidden in legitimate UI elements like tabs and accordions is fine and indexed normally.
The pattern across every spam system is the same: Google targets intent to manipulate. Techniques that mimic earned signals without earning them have a shelf life, and the cleanup always costs more than the shortcut saved.
Debunked: What You Can Stop Worrying About
Save your budget. The following are confirmed non-factors:
Keyword density. No optimal percentage exists. Write naturally.
LSI keywords. Google does not use latent semantic indexing. Cover topics thoroughly instead.
Word count. No target number. Match depth to intent.
Meta keywords tag. Ignored since 2009.
W3C validation. Not a factor. Fix code errors that break rendering or crawling; ignore the validator badge.
Social share counts. Not directly counted. Valuable only for the exposure that leads to real signals.
Bounce rate from Google Analytics. Google does not use your Analytics data for ranking.
Domain age by itself. Authority comes from what a domain has earned, not how long it has existed.
Spelling and grammar as a direct factor. Confirmed not a factor, though readers care, and reader behavior is.
Conclusion
Strip away the mythology and the durable picture looks like this. Technical health and crawlability get you indexed. Content that demonstrates real experience and covers intent in depth gets you considered. Backlinks from relevant, trafficked pages get you into the top ten. And user behavior, the clicks, long visits, and branded searches that Navboost aggregates, determines whether you stay there.
Every factor in this guide feeds one of those four stages. When you are deciding where to spend the next hour or dollar, ask which stage is your bottleneck, and work on that.
Backlinks will get you to the top 10 in Google. User engagement will keep you there.
FAQs
Published on: 2020-01-02
Updated on: 2026-06-12