Dataframe
 
1.0

joinWith

Edit pageLast modified: 20 May 2025

Joins two DataFrame objects by a join expression.

This function is a join variant that lets you match data using any expression that returns a Boolean, which also gives opportunity to perform operations that require values from both matching rows. Can be helpful if the data you want to join wasn't designed relational and requires heuristics to tell if rows are matching, or has relations other than equals.

For example, you can match rows based on:

  • Order relations such as >, <, in for numerical or DateTime values

  • Spatial relations, like distance within a certain range if your data includes spatial or geographical values

  • String equivalence using more complex comparison techniques, such as contains, regular expressions, Levenshtein Distance or language models.