nelscorrea.github.io

PyData logo


PyData Miami 2022

https://pydata.org/miami2022/

09/22/2022


Title: Enterprise Semantic Search with Python Large Language Models

Speaker: Nelson Correa, Ph.D.
Twitter: @nelscorrea
Linkedin: https://linkedin.com/in/ncorrea

Abstract:

Enterprise Search is a key use case in big data and business computing. In this talk we introduce Enterprise Semantic Search with Large Language Models (LLMs), and present a working demonstration in the financial domain. Semantic search is search based on meaning representations, instead of literal document and query keywords. We use the recent HuggingFace transformers library, together with related Python libraries (TensorFlow, sklearn and UMAP) for NLP and deep learning. Approaches, data visualization, metrics and datasets for search system evaluation are introduced. The talk will be of interest to developers working on text search and new unstructured data applications. Slides and a demo notebook will be available at the time of PyData Miami 2022.

Enterprise Semantic Search with Python Large Language Models



Jupyter notebook

Materials