🔎
Google Search for beginners
Home
  • Introduction
  • Google Search Essentials
    • Overview
    • Google Search Technical Requirements
    • Spam Policies
  • SEO Basics
    • SEO Beginner's Guide
    • How Google Search Works
    • Creating Helpful, Reliable Content
    • Do You Need an SEO Expert?
    • Maintaining Your Website’s SEO
    • Developer's Guide to Google Search
    • How to Get Your Website Listed on Google
  • crawling and indexing
    • Overview
    • File formats Google can index
    • URL structure
    • Links
    • Sitemaps
      • Create and submit a sitemap
      • Manage your sitemaps
      • Image-specific sitemaps
      • News-oriented sitemaps
      • Video sitemaps and alternatives
      • Combining different sitemap types
    • Managing Google Crawlers
      • Reducing the crawl rate of Googlebot
      • Verifying the Googlebot and other Google crawlers
      • Managing Crawl Budget for Large Sites
      • HTTP Status Codes, Network, and DNS Errors
      • Types of Google Crawlers
      • Googlebot Explained
      • Google Read Aloud Service
      • Google API
      • Understanding Feedfetcher
    • Robots.txt
      • Creating and Submitting Robots.txt
      • Updating Robots.txt
      • Google's Interpretation of Robots.txt
    • Canonicalization
      • Specifying Canonicals Using rel="canonical" and Other Methods
      • Resolving Canonicalization Issues
    • Canonicalization for Mobile Sites and Mobile-First Indexing
    • AMP (Accelerated Mobile Pages)
      • Understanding How AMP Works in Search Results
      • Enhancing Your AMP Content
      • Validating AMP Content
      • Removing AMP Content
    • JavaScript
      • Fixing Search-Related JavaScript Issues
      • Resolving Issues with Lazy-Loaded Content
      • Using Dynamic Rendering as a Workaround
    • Page and Content Metadata
      • Meta Tags
      • Using Robots Meta Tag, data-nosnippet, and X-Robots-Tag noindex
      • noindex Explained
      • rel Attributes
    • Removals
      • Removing Pages from Search Results
      • Removing Images from Search Results
      • Handling Redacted Information
    • Redirects and Google Search
      • Switching Website Hosting Services
      • Handling URL Changes During Site Moves
      • A/B Testing for Sites
      • Pause or Disable a Website
Powered by GitBook
On this page
  1. crawling and indexing

File formats Google can index

File Types Indexable by Google

Google Search aims to provide access to the widest possible range of online information. To that end, its indexing capabilities extend beyond simple web pages to encompass a diverse array of file formats.

Text-Based and Document Formats

Google can index the content of most text-based files, allowing users to search for information contained within them. This includes common document formats used for a variety of purposes, such as:

Portable Document Formats:

  • Adobe Portable Document Format (.pdf)

Word Processing Formats:

  • Hancom Hanword (.hwp)

  • Microsoft Word (.doc, .docx)

  • OpenOffice text (.odt)

  • Rich Text Format (.rtf)

Spreadsheet Formats:

  • Comma-Separated Values (.csv)

  • Microsoft Excel (.xls, .xlsx)

  • OpenOffice spreadsheet (.ods)

Presentation Formats:

  • Microsoft PowerPoint (.ppt, .pptx)

  • OpenOffice presentation (.odp)

Other Notable Formats:

  • Adobe PostScript (.ps)

  • Electronic Publication (.epub)

  • Google Earth (.kml, .kmz)

  • GPS eXchange Format (.gpx)

  • HTML (.htm, .html, other file extensions)

  • TeX/LaTeX (.tex)

  • Text (.txt, .text, other file extensions)

Source Code: Recognizing the importance of software development and the wealth of information contained within source code, Google indexing also covers common programming languages:

  • Basic source code (.bas)

  • C/C++ source code (.c, .cc, .cpp, .cxx, .h, .hpp)

  • C# source code (.cs)

  • Java source code (.java)

  • Perl source code (.pl)

  • Python source code (.py)

Mobile Formats: Content designed for mobile devices is also readily indexed:

  • Wireless Markup Language (.wml, .wap)

Data Formats: Structured data formats are crucial for information exchange, and Google indexing supports:

  • XML (.xml)

Media Formats

Beyond text-based content, Google indexing extends to various media formats, allowing users to discover images and videos relevant to their searches.

Image Formats:

  • BMP

  • GIF

  • JPEG

  • PNG

  • WebP

  • SVG

Video Formats:

  • 3GP

  • 3G2

  • ASF

  • AVI

  • DivX

  • M2V

  • M3U

  • M3U8

  • M4V

  • MKV

  • MOV

  • MP4

  • MPEG

  • OGV

  • QVT

  • RAM

  • RM

  • VOB

  • WebM

  • WMV

  • XAP

Search by File Type

To refine search results and focus on specific file types, users can leverage the filetype: operator within Google Search. For example:

  • Searching for architectural blueprints in PDF format: filetype:pdf blueprints sydney opera house

  • Finding research papers on climate change in DOCX format: filetype:docx climate change impact on coral reefs

  • Discovering datasets on population growth in CSV format: filetype:csv global population growth statistics

By specifying the desired file type, users can efficiently locate the information they need in their preferred format. This powerful feature enhances the precision and efficiency of information retrieval using Google Search.

PreviousOverviewNextURL structure

Last updated 10 months ago