My regex against the text "https://www.newtest.com/testing" is
(?=\b((?<![^\p{Alnum}\p{Punct}])(\p{Alnum}+\p{Punct}\p{Alnum}+){2})\b)
My expected tokens are:
www.newtest.com
newtest.com/testing
The above regex is spitting these tokens but alongwith that, it is also giving two extra tokens t.com and m/testing which are not required.