> ## Documentation Index
> Fetch the complete documentation index at: https://docs.tensorlake.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Get

Get the details of a specific dataset associated with your project.

This endpoint allows you to retrieve information about the dataset, including its ID, name, description, and any associated metadata. The dataset's settings can be modified using the [Update Dataset](./update) endpoint.

The properties of the dataset include:

* `name`: The name given to the dataset.
* `dataset_id`: The unique identifier for the dataset.
* `description`: A brief description of the dataset if provided.
* `status`: The current status of the dataset (e.g., `idle`, `processing`). If the dataset has at least one parse job in the
  `processing` or `pending` state, the dataset status will be `processing`.
* `created_at`: The timestamp when the dataset was created, formatted as a RFC 3339 string.
* `updated_at`: The timestamp when the dataset was last updated, formatted as a RFC 3339 string.


## OpenAPI

````yaml get /documents/v2/datasets/{dataset_id}
openapi: 3.1.0
info:
  title: Tensorlake API
  description: >-
    Tensorlake Cloud APIs for Sandboxes, Document Ingestion, and Serverless
    Workflows
  license:
    name: ''
  version: 0.1.0
servers:
  - url: https://api.tensorlake.ai/
security:
  - bearerAuth: []
tags:
  - name: Tensorlake Cloud API
    description: >-
      Tensorlake Cloud APIs for Sandboxes, Document Ingestion, and Serverless
      Workflows
paths:
  /documents/v2/datasets/{dataset_id}:
    get:
      tags:
        - datasets
      operationId: get_dataset_v2
      parameters:
        - name: dataset_id
          in: path
          description: The ID of the dataset to retrieve
          required: true
          schema:
            type: string
        - name: include_analytics
          in: query
          description: >-
            Retrieve the dataset analytics.


            When set to `true`, the response will include the dataset's
            analytics

            data. Including:

            - Number of running parsing jobs

            - Number of completed parsing jobs

            - Number of failed parsing jobs

            - Number of pending parsing jobs


            Defaults to `false`.
          required: false
          schema:
            type: boolean
      responses:
        '200':
          description: Dataset retrieved successfully
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/Dataset'
        '401':
          description: Unauthorized. Invalid or missing credentials
        '403':
          description: Forbidden. You do not have permission to access this resource
        '404':
          description: Dataset not found
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/ApiError'
        '500':
          description: Internal server error
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/ApiError'
components:
  schemas:
    Dataset:
      type: object
      required:
        - name
        - dataset_id
        - status
        - created_at
        - updated_at
      properties:
        name:
          type: string
          description: |-
            The name of the dataset.

            This is a human-readable name that identifies the dataset.
          example: Invoices Dataset
        dataset_id:
          type: string
          description: |-
            The unique identifier for the dataset.

            This identifier is used to refer to the dataset in API endpoints and
            operations.

            This value is automatically generated and is unique within the
            organization and project context.
          example: dataset_12345
        description:
          type:
            - string
            - 'null'
          description: >-
            An optional description of the dataset.


            This description is the one provided during dataset creation or
            update.
          example: This dataset contains invoices for the year 2023.
        status:
          $ref: '#/components/schemas/DatasetStatus'
          description: |-
            The current status of the dataset.

            This indicates whether the dataset is currently idle or processing.
        created_at:
          type: string
          description: |-
            The date and time when the dataset was created.

            The data is in RFC 3339 format (e.g., "2023-10-01T12:00:00Z").
          example: '2023-10-01T12:00:00Z'
        updated_at:
          type: string
          description: |-
            The date and time when the dataset was last updated.

            The data is in RFC 3339 format (e.g., "2023-10-01T12:00:00Z").
          example: '2023-10-01T12:00:00Z'
        analytics:
          oneOf:
            - type: 'null'
            - $ref: '#/components/schemas/DatasetParseJobAnalytics'
              description: >-
                Understand the status of the dataset and its parse jobs.


                This field provides insights into the dataset's processing
                state,

                including the number of parse jobs in various states
                (processing,

                pending, error, successful).


                To retrieve detailed analytics, you can pass the
                `include_analytics`

                query parameter


                This is useful for monitoring and analytics purposes.
    ApiError:
      type: object
      required:
        - message
        - code
        - timestamp
      properties:
        message:
          type: string
          description: A human-readable error message
        code:
          $ref: '#/components/schemas/ApiErrorCode'
          description: The error code, which can be used to programmatically handle errors
        timestamp:
          type: integer
          format: int64
          description: Millis since Unix epoch; easy to parse in every language
        trace_id:
          type:
            - string
            - 'null'
          description: Optional request correlation-id for distributed tracing
        details:
          description: Optional field-level validation errors, etc.
    DatasetStatus:
      type: string
      enum:
        - idle
        - processing
    DatasetParseJobAnalytics:
      type: object
      required:
        - total_processing_parse_jobs
        - total_pending_parse_jobs
        - total_error_parse_jobs
        - total_successful_parse_jobs
        - total_jobs
      properties:
        total_processing_parse_jobs:
          type: integer
          format: int64
          description: The total number of parse jobs that are on the `processing` state.
        total_pending_parse_jobs:
          type: integer
          format: int64
          description: >-
            The total number of parse jobs that are on the `pending` state.


            Pending parse jobs are those that have been created but not yet
            started

            processing.
        total_error_parse_jobs:
          type: integer
          format: int64
          description: >-
            The total number of parse jobs that have encountered an error during

            processing.


            These jobs have failed to complete successfully and require
            attention.
        total_successful_parse_jobs:
          type: integer
          format: int64
          description: >-
            The total number of parse jobs that have been successfully
            processed.


            These jobs have completed without errors and have produced results.
        total_jobs:
          type: integer
          format: int64
          description: >-
            The total number of parse jobs that have been created for the
            dataset.
    ApiErrorCode:
      oneOf:
        - type: string
          enum:
            - QUOTA_EXCEEDED
        - type: string
          enum:
            - INVALID_JSON_SCHEMA
        - type: string
          enum:
            - INVALID_CONFIGURATION
        - type: string
          enum:
            - INVALID_PAGE_CLASSIFICATION
        - type: string
          enum:
            - ENTITY_NOT_FOUND
        - type: string
          enum:
            - ENTITY_ALREADY_EXISTS
        - type: string
          enum:
            - INVALID_FILE
        - type: string
          enum:
            - INVALID_PAGE_RANGE
        - type: string
          enum:
            - INVALID_MIME_TYPE
        - type: string
          enum:
            - INVALID_DATASET_NAME
        - type: string
          enum:
            - INVALID_JOB_STATE
        - type: string
          enum:
            - INTERNAL_ERROR
        - type: string
          enum:
            - INVALID_MULTIPART
        - type: string
          enum:
            - MULTIPART_STREAM_END
        - type: string
          enum:
            - CLIENT_DISCONNECT
        - type: string
          enum:
            - INVALID_ID
        - type: object
          required:
            - INVALID_QUERY_PARAMS
          properties:
            INVALID_QUERY_PARAMS:
              type: object
              required:
                - property
              properties:
                property:
                  type: string
                message:
                  type:
                    - string
                    - 'null'
  securitySchemes:
    bearerAuth:
      type: http
      scheme: bearer

````