Fractions like those can be considered as binomial random variables. If you have to compare two binomial r.v.s what you do usually is a Pearson's chi-squared test.
Also, in order to have a unique result from all your data, you may want to sum them up among different groups (subject/author), so that you end up with:
- the total number of sentences before review
- the total number of sentences after review
- the total number of understandable sentences before review
- the total number of understandable sentences after review.
On this data, you can make a chi-squared test.
Caveat
The binomial r.v. assumes that each sentence readability is independent on the adjoining sentences readability. This may not be perfectly the case, and if so, it may inflate the significance of your result. This phenomenon is called underdispersion (and the opposite overdispersion) and it's not super easy to solve, so I will slide on it here.
Other options
- If you do some research, you will find that Pearson chi-squared test has alternatives, like Fisher exact test or G test. Those also apply to your case, in case you want to consider them. They will almost certainly give you all the same result anyway.
- If you want to avoid adding up all data from the different groups, there are more complicate alternatives to the simple test I suggested. You can create a mixed effect logit model, with random effect groups and review as a fixed effect. You could also, in
theory, sum up the test statistic from many chi-sqaured tests in order to aggregate it, this is good in theory but fairly heterodox, so I don't recommend it.