> ## Documentation Index > Fetch the complete documentation index at: https://docs.sourcebot.dev/llms.txt > Use this file to discover all available pages before exploring further. # Linking code from GitHub Sourcebot can sync code from GitHub.com, GitHub Enterprise Server, and GitHub Enterprise Cloud. If you're not familiar with Sourcebot [connections](/docs/connections/overview), please read that overview first. ## Examples ```json theme={null} { "type": "github", "repos": [ "sourcebot-dev/sourcebot", "getsentry/sentry", "torvalds/linux" ] } ``` ```json theme={null} { "type": "github", "orgs": [ "sourcebot-dev", "getsentry", "vercel" ] } ``` ```json theme={null} { "type": "github", "users": [ "torvalds", "ggerganov" ] } ``` ```json theme={null} { "type": "github", // Sync all repos in `my-org` that have a topic that... "orgs": [ "my-org" ], // ...match one of these glob patterns. "topics": [ "test-*", "ci-*", "k8s" ] } ``` ```json theme={null} { "type": "github", // Include all repos in my-org... "orgs": [ "my-org" ], // ...except: "exclude": { // repos that are archived "archived": true, // repos that are forks "forks": true, // repos that match these glob patterns "repos": [ "my-org/repo1", "my-org/repo2", "my-org/sub-org-1/**", "my-org/sub-org-*/**" ], "size": { // repos that are less than 1MB (in bytes)... "min": 1048576, // or repos greater than 100MB (in bytes) "max": 104857600 }, // repos with topics that match these glob patterns "topics": [ "test-*", "ci" ] } } ``` ## Authenticating with GitHub In order to index private repositories, you'll need to authenticate with GitHub. Sourcebot supports the following mechanisms of authenticating a GitHub connection: This feature is only available with an active Enterprise license. Please add your [license key](/docs/license-key) to activate it. Register a new [GitHub App](https://docs.github.com/en/apps/creating-github-apps/registering-a-github-app/registering-a-github-app#registering-a-github-app) and provide it with the following permissions: * “Contents” repository permissions (read) * “Metadata” repository permissions (read) * “Members” organization permissions (read) * “Email addresses” account permissions (read) This can be the same GitHub App you've registered and configured as an [external identity provider](/docs/configuration/idp#github) Install the GitHub App into the GitHub orgs that you want Sourcebot to be aware of. **Sourcebot will only be able to index repos from orgs with the GitHub App installed.** Create a [private key](https://docs.github.com/en/apps/creating-github-apps/authenticating-with-a-github-app/managing-private-keys-for-github-apps) for the GitHub App. Create a new `apps` object in the Sourcebot [config file](/docs/configuration/config-file). The private key you created in the previous step must be passed in as a [token](/docs/configuration/config-file#tokens). ```json wrap icon="code" theme={null} "apps": [ { "type": "github", // must be github "id": "1234567", // Your GitHub App ID "privateKey": { "env": "GITHUB_APP_PRIVATE_KEY" // Token which contains your Github App private key } } ] ``` That's it! Sourcebot will now use this GitHub App to authenticate when pulling repos for this connection. Create a new fine-grained PAT [here](https://github.com/settings/personal-access-tokens/new). Select the resource owner and the repositories that you want Sourcebot to have access to. Next, under "Repository permissions", select permissions `Contents` and `Metadata` with access `Read-only`. The permissions should look like the following: GitHub PAT Scope [GitHub docs](https://docs.github.com/en/authentication/keeping-your-account-and-data-secure/managing-your-personal-access-tokens#fine-grained-personal-access-tokens) Next, provide the PAT via a [token](/docs/configuration/config-file#tokens) which is referenced in the `token` field in the [connection](/docs/connections/overview) config object. The most common mechanism of doing this is defining an environment variable that holds the PAT: ```json theme={null} { "type": "github", "token": { // note: this env var can be named anything. It // doesn't need to be `GITHUB_TOKEN`. "env": "GITHUB_TOKEN" } // .. rest of config .. } ``` That's it! Sourcebot will now use this PAT to authenticate when pulling repos for this connection. Create a new PAT [here](https://github.com/settings/tokens/new) and make sure you select the `repo` scope: GitHub PAT Scope [GitHub docs](https://docs.github.com/en/authentication/keeping-your-account-and-data-secure/managing-your-personal-access-tokens#personal-access-tokens-classic) Next, provide the PAT via a [token](/docs/configuration/config-file#tokens) which is referenced in the `token` field in the [connection](/docs/connections/overview) config object. The most common mechanism of doing this is defining an environment variable that holds the PAT: ```json theme={null} { "type": "github", "token": { // note: this env var can be named anything. It // doesn't need to be `GITHUB_TOKEN`. "env": "GITHUB_TOKEN" } // .. rest of config .. } ``` That's it! Sourcebot will now use this PAT to authenticate when pulling repos for this connection. ## Connecting to a custom GitHub host To connect to a GitHub host other than `github.com`, provide the `url` property to your config: ```json theme={null} { "type": "github", "url": "https://github.example.com" // .. rest of config .. } ``` ## Schema reference [schemas/v3/github.json](https://github.com/sourcebot-dev/sourcebot/blob/main/schemas/v3/github.json) ```json theme={null} { "$schema": "http://json-schema.org/draft-07/schema#", "type": "object", "title": "GithubConnectionConfig", "properties": { "type": { "const": "github", "description": "GitHub Configuration" }, "token": { "description": "A Personal Access Token (PAT).", "anyOf": [ { "type": "object", "properties": { "env": { "type": "string", "description": "The name of the environment variable that contains the token." } }, "required": [ "env" ], "additionalProperties": false }, { "type": "object", "properties": { "googleCloudSecret": { "type": "string", "description": "The resource name of a Google Cloud secret. Must be in the format `projects//secrets//versions/`. See https://cloud.google.com/secret-manager/docs/creating-and-accessing-secrets" } }, "required": [ "googleCloudSecret" ], "additionalProperties": false } ] }, "url": { "type": "string", "format": "url", "default": "https://github.com", "description": "The URL of the GitHub host. Defaults to https://github.com", "examples": [ "https://github.com", "https://github.example.com" ], "pattern": "^https?:\\/\\/[^\\s/$.?#].[^\\s]*$" }, "users": { "type": "array", "items": { "type": "string", "pattern": "^[\\w.-]+$" }, "default": [], "examples": [ [ "torvalds", "DHH" ] ], "description": "List of users to sync with. All repositories that the user owns will be synced, unless explicitly defined in the `exclude` property." }, "orgs": { "type": "array", "items": { "type": "string", "pattern": "^[\\w.-]+$" }, "default": [], "examples": [ [ "my-org-name" ], [ "sourcebot-dev", "commaai" ] ], "description": "List of organizations to sync with. All repositories in the organization visible to the provided `token` (if any) will be synced, unless explicitly defined in the `exclude` property." }, "repos": { "type": "array", "items": { "type": "string", "pattern": "^[\\w.-]+\\/[\\w.-]+$" }, "default": [], "description": "List of individual repositories to sync with. Expected to be formatted as '{orgName}/{repoName}' or '{userName}/{repoName}'." }, "topics": { "type": "array", "items": { "type": "string" }, "minItems": 1, "default": [], "description": "List of repository topics to include when syncing. Only repositories that match at least one of the provided `topics` will be synced. If not specified, all repositories will be synced, unless explicitly defined in the `exclude` property. Glob patterns are supported.", "examples": [ [ "docs", "core" ] ] }, "exclude": { "type": "object", "properties": { "forks": { "type": "boolean", "default": false, "description": "Exclude forked repositories from syncing." }, "archived": { "type": "boolean", "default": false, "description": "Exclude archived repositories from syncing." }, "repos": { "type": "array", "items": { "type": "string" }, "default": [], "description": "List of individual repositories to exclude from syncing. Glob patterns are supported." }, "topics": { "type": "array", "items": { "type": "string" }, "default": [], "description": "List of repository topics to exclude when syncing. Repositories that match one of the provided `topics` will be excluded from syncing. Glob patterns are supported.", "examples": [ [ "tests", "ci" ] ] }, "size": { "type": "object", "description": "Exclude repositories based on their disk usage. Note: the disk usage is calculated by GitHub and may not reflect the actual disk usage when cloned.", "properties": { "min": { "type": "integer", "description": "Minimum repository size (in bytes) to sync (inclusive). Repositories less than this size will be excluded from syncing." }, "max": { "type": "integer", "description": "Maximum repository size (in bytes) to sync (inclusive). Repositories greater than this size will be excluded from syncing." } }, "additionalProperties": false } }, "additionalProperties": false }, "revisions": { "type": "object", "description": "The revisions (branches, tags) that should be included when indexing. The default branch (HEAD) is always indexed. A maximum of 64 revisions can be indexed, with any additional revisions being ignored.", "properties": { "branches": { "type": "array", "description": "List of branches to include when indexing. For a given repo, only the branches that exist on the repo's remote *and* match at least one of the provided `branches` will be indexed. The default branch (HEAD) is always indexed. Glob patterns are supported. A maximum of 64 branches can be indexed, with any additional branches being ignored.", "items": { "type": "string" }, "examples": [ [ "main", "release/*" ], [ "**" ] ], "default": [] }, "tags": { "type": "array", "description": "List of tags to include when indexing. For a given repo, only the tags that exist on the repo's remote *and* match at least one of the provided `tags` will be indexed. Glob patterns are supported. A maximum of 64 tags can be indexed, with any additional tags being ignored.", "items": { "type": "string" }, "examples": [ [ "latest", "v2.*.*" ], [ "**" ] ], "default": [] } }, "additionalProperties": false } }, "required": [ "type" ], "additionalProperties": false } ``` ## See also * [Syncing GitHub Access permissions to Sourcebot](/docs/features/permission-syncing#github)