From Air Quality to Aircraft
Apache NiFi
Snowflake
Apache Iceberg
AI
GenAI
LLM
RAG
https://www.dbta.com/DataSummit/2025/Timothy-Spann.aspx
Tim Spann is a Senior Sales Engineer @ Snowflake. He works with Generative AI, LLM, Snowflake, SQL, HuggingFace, Python, Java, Apache NiFi, Apache Kafka, Apache Pulsar, Apache Flink, Flink SQL, Apache Spark, Big Data, IoT, Cloud, AI/DL, Machine Learning, and Deep Learning. Tim has over ten years of experience with the IoT, big data, distributed computing, messaging, streaming technologies, and Java programming. Previously, he was a Principal Developer Advocate at Zilliz, Principal Developer Advocate at Cloudera, Developer Advocate at StreamNative, Principal DataFlow Field Engineer at Cloudera, a Senior Solutions Engineer at Hortonworks, a Senior Solutions Architect at AirisData, a Senior Field Engineer at Pivotal and a Senior Team Leader at HPE. He blogs for DZone, where he is the Big Data Zone leader, and runs a popular meetup in Princeton & NYC on Big Data, Cloud, IoT, deep learning, streaming, NiFi, the blockchain, and Spark. Tim is a frequent speaker at conferences such as ApacheCon, DeveloperWeek, Pulsar Summit and many more. He holds a BS and MS in Computer Science.
https://github.com/tspannhw/SpeakerProfile
https://www.dbta.com/DataSummit/2025/program.aspx#17305
From Air Quality to Aircraft & Automobiles, Unstructured Data Is Everywhere
Spann explores how Apache NiFi can be used to integrate open source LLMs to implement scalable and efficient RAG pipelines. He shows how any kind of data including semistructured, structured and unstructured data from a variety of sources and types can be processed, queried, and used to feed large language models for smart, contextually aware answers. Look for his example utilizing Cortex AI, LLAMA, Apache NiFi, Apache Iceberg, Snowflake, open source tools, libraries, and Notebooks.
Speaker:
Timothy Spann, Senior Solutions Engineer, Snowflake
may 14 2025
boston