A document loader for loading data from PDFs.

Example

const loader = new WebPDFLoader(new Blob());
const docs = await loader.load();
console.log({ docs });

Hierarchy

Constructors

  • Parameters

    • blob: Blob
    • __namedParameters: {
          parsedItemSeparator: undefined | string;
          pdfjs: undefined | (() => Promise<{
              getDocument: ((src) => PDFDocumentLoadingTask);
              version: string;
          }>);
          splitPages: undefined | boolean;
      } = {}
      • parsedItemSeparator: undefined | string
      • pdfjs: undefined | (() => Promise<{
            getDocument: ((src) => PDFDocumentLoadingTask);
            version: string;
        }>)
      • splitPages: undefined | boolean

    Returns WebPDFLoader

Properties

blob: Blob
parsedItemSeparator: string
splitPages: boolean = true

Methods

  • Loads the contents of the PDF as documents.

    Returns Promise<Document<Record<string, any>>[]>

    An array of Documents representing the retrieved data.

  • Loads the documents and splits them using a specified text splitter.

    Parameters

    Returns Promise<Document<Record<string, any>>[]>

    A Promise that resolves with an array of Document instances, each split according to the provided TextSplitter.

Generated using TypeDoc