07. PowerPoint

Microsoft PowerPoint

Microsoft PowerPoint is a presentation program developed by Microsoft.

This covers how to load a Microsoft PowerPoint document into a document format that can be used downstream.

Please refer to the official documentation for detailed Unstructured setup instructions.

# Installing the package
# !pip install -qU unstructured python-pptx
from langchain_community.document_loaders import UnstructuredPowerPointLoader

# UnstructuredPowerPointLoader generation
loader = UnstructuredPowerPointLoader("./data/sample-ppt.pptx")

# load data
docs = loader.load()

# Output the number of loaded documents
print(len(docs))
1
Unstructured is a variety of texts chunks about various "elements" creates.

Basically combined into one document But it returns, mode="elements"You can easily separate elements by specifying.

Last updated