I am struggling the intel intrinsics, and come up with the following assumption.
If we have four instructions to execute. The CPI's of them are all 0.5. Then if there is NOT any dependency among them, the lowest number of cycles to execute them is 2, since every two of them can feed one cycle.
Am I correct?