I'm still toying with things and just learning this, so please forgive any incorrect terminology.
My toy data set is a collection of recipes with a fairly significant overlap in ingredients. I'm using these as my features, and using Pearson squared distance as described in Programming Collective Intelligence to determine the correlation between recipes. I'm not looking to "train" anything; I currently would just like to compare the recipes in my corpus and find $X$ nearest and $Y$ farthest neighbors.
I thought it'd be fun to expand this, and include people's reviews/comments on the recipes as part of the distance calculation, leading to a closer association between recipes that were both described as "sweet" or "tart" or "sour". I don't know how to do this along with the other features though; I've found examples of bags-of-words for similarity, but I can't seem to find any information about combining it with other features.
Is this a valid approach, and is there some terminology that I should be looking for? Thanks.