I tried with the same dataset:
with prompt magic:
without prompt magic:
more complex query, with prompt magic:
same complex query, without prompt magic: (this shows the training set is not enough)