When working with discord messages, emails, RSS items we have a lot of time series feeds with roughly corresponding data formats. We don't give the LLM much of a hint on how to perform such a process and there's likely room for improvement in the <guide> we provide.
We could also attack this from the view of #737 by connect being a somewhat well-known keyword to trigger normalization and merge behaviour.