12

Why are cost functions typically assumed to be convex in producer theory of (introductory) microeconomics?

For me this goes against the intuition of economies of scale. There are fixed costs (FC) which contribute to concavity of the cost function. There are also variable costs (VC) which may be concave, linear or convex. If we are on the concave part of VC, total costs (TC) must also be concave due to both FC and VC being concave. If we are in the linear part of VC, TC are again concave due to the concavity of FC. And if we are in the convex part of VC, TC may be either concave, linear or convex depending on the relative influence/weight of FC and VC on/in TC. However, even in the case where TC is convex, the producer does not have to operate that way. It can rather operate multiple copies of its production facility each at the level where TC is concave or linear to ensure the TC added over all production facilities is never convex.*,** This makes the assumption of convex costs suspect for me.

I do see one reason why convexity could occur, though. It is if resources are becoming scarce and the producer is big enough to influence the prices in the input markets. However, the producers are assumed to be small in perfect competition, yet their cost functions are assumed to be convex. This appears contradictory to me. So what am I failing to see?

*The fact that FC are incurred with each copy of the production facility might or might not make this a poor strategy depending on the relative weight of FC and VC.
**I think I borrowed the idea for this argument from Varian "Microeconomic Analysis". In 3rd edition, Section 5.2 "The geometry of costs" p. 68 it says:

In the long run all costs are variable costs; in such circumstances increasing average costs seems unreasonable since a firm could always replicate its production process. Hence, the reasonable long-run possibilities should be either constant or decreasing average costs.

The cost function is also shown to be concave in the subsequent section 5.4 "Factor prices and cost functions".


Edit: Thank you for all the great answers! It seems we can have different plausible stories with opposite implications. So far it seems one can plausibly argue for both convex and concave costs. The crux of the matter becomes the assumptions needed to make one story more plausible than the other. Thus the question is, what are the assumptions taken to make convex costs plausible (and concave costs implausible) in introductory microeconomics?

Richard Hardy
  • 2,365
  • 1
  • 14
  • 28
  • If $c$ is your cost, it and you pick two points $x \neq y$, it is not unreasonable to suppose that the cost of $tx+(1-t)y$ (with $t \in [0,1]$) can be no greater than the cost of $t $ 'units' of $x$ combined with $(1-t)$ 'units' of $y$. That is, $c(tx+(1-t)y) \le t c(x)+(1-t)c(y)$. – copper.hat May 18 '21 at 03:47
  • @copper.hat, that would go against the idea of economies of scale and consequently I find it questionable. – Richard Hardy May 18 '21 at 06:06
  • Economies of scale do not say that bigger is better, though. They just say there's some optimal point (which changes with all sorts of variables) where the economy is highest - smaller and bigger is less economical. And even that is assuming you get all the resources you need for the same prices, including labour costs. And even that's still simplified, since you can have many valleys and hills on the economies of scale curve - it goes up and down, up and down... – Luaan May 19 '21 at 05:57
  • @Luaan, that may well be so, but the argument outlined in my post suggests otherwise, thus the question. – Richard Hardy May 19 '21 at 10:40

6 Answers6

11

There are several reasons:

  1. Didactic Reasons: Other users seem to have missed it but in your question you specify you are talking about "(introductory) microeconomics" [emphasis mine].

    Well the most prosaic answer is simply that it is much easier to solve cost minimization, or various other models when costs are assumed to be convex.

    This in itself is sufficient reason to construct problems with convex cost functions in introductory microeconomic courses. Demand and supply are not linear, yet in most textbooks and introductory problem they will be assumed to be linear. In addition, in real life demand can be sometimes even upward sloping if a good is a Giffen good, and supply can actually be downward sloping (e.g. some labor supply in some special cases depending on people's preference between consumption and leisure). Yet introductory textbooks typically show downward sloping demand and upward sloping supply (e.g. see Mankiw Principles of Economics that discusses these concepts but only briefly, or more narrowly micro introductory books such as Frank Microeconomics & Behavior).

    This is to a great degree for didactic reasons. It is much better for students to first master basics with simple models and when it comes to learning about costs having nicely behaved convex cost functions with single minimum makes learning easier than having to teach cost minimization with concave cost curves. Hence, even if empirically most cost curves would be concave not convex it would be very bad teaching practice to start with concave functions (or just go for full blown realism where cost functions might be piecewise, have different concavity/convexity at different points, be ill defined somewhere etc).

  2. Because of Decreasing Returns to Scale - This was covered in great detail by Bayesian, but let me add more arguments and also rebuff some of your arguments in the question.

    First, it is not unreasonable to assume that costs are convex in a long-run. In a world of scarcity firm cannot forever increase its demand for factors of production without affecting costs of these factors or inputs as well, their prices will rise eventually (ceteris paribus). We have crystal clear evidence that wages rise in tight labor markets, or that generally speaking shift in demand to the right (ceteris paribus) rises prices. You argue that in perfect competition models firms are assumed to be small, but that is not a good argument in this case. This is because firms are assumed to be too small in terms of their output being able to affect market price of their output so price of output can be taken as given (See Frank Microeconomics and Behavior pp 337). Perfect competition does not require price of inputs to be taken as given. In fact, firm might operate on perfectly competitive market while facing just monopolistically competitive factor market (where the firm is consumer not producer).

    Next, you argue that thanks to fixed costs one firms could just continuously invest in a new factories, but this argument should be false. A fix cost by definition cannot vary with output. If firm increases output by building new factory, the cost of factory ceases to be fixed costs. In fact fixed costs primarily exist in short-run as in a long-run most costs are variable (see Mankiw Principles of economics pp 260). In a long-run as you try to build more and more factories you run into the same problems of scarcity of land, capital and labor and thus bid up their prices. In fact this is nicely visualized and explained in the Mankiw textbook with the picture below:

enter image description here

Empirically, we observe that many industries have decreasing returns to scale (although constant returns to scale are common as well), and increasing returns to scale are rare (although not completely uncommon). See for example: Basu & Fernald, 1997; Gao & Kehrig 2017.

Introductory texts by their nature will not deal with specific cases but more general ones. Most introductory textbooks again do not spend too much time on Giffen goods not just because modeling them would be difficult for 101 students but also because they are not very often seen (although, I am not claiming non-convex cost functions are as rare as Giffen goods).

  1. On the Aesthetics: I think Giskard raises a valid point that there are probably many economists who assume convex costs just for mathematical elegance. However:

    • I think Giskard slightly exaggerates the problem and is bit too cynical about it. For sure there are economists who value mathematical elegance uber ales, but there is increasing trend in share of empirical papers (see Angrist et al 2017), even in microeconomics, and I think that a reasonable non-cynical explanation for the small share of micro empirical papers is that until very recently there was always lack of good micro data (in addition this is also due to breakdown, you can see the share of industrial organization empirical papers (that also heavily use cost functions) is quite high).

    • Empirically, most industries do not exhibit increasing returns to scale. While non-convex functions are definitely real (especially along some points of cost curve), empirical evidence does show that decreasing returns to scale (although constant returns to scale as well) are quite common (e.g. see Basu & Fernald, 1997; Gao & Kehrig 2017), but I think Giskard has definitely valid point that some modelers will ignore empirics for sake of mathematical elegance.

    • Lastly, but not least, I think mathematical elegance can explain why such assumption features heavily in some published theoretical work, I don't think it can explain why it is featured in introductory micro texts. Is really quadratic cost function $c=q^2$ mathematically elegant? I don't think so but that is probably the most commonly used cost function you will ever see in intro texts.


Regarding the Varian quote. Varian on page 67 states that he will first cover situation with fixed factor costs and later move to variable factor costs. Hence, unless I am misreading Varian I think the statement on the page 68 is made under assumption of constant factor prices. However, the explanation above by Mankiw does not assume that.

1muflon1
  • 56,292
  • 4
  • 53
  • 108
  • Very helpful, thank you. So you would object my quotation of Varian by pointing to the picture from Mankiw? Fair enough, it looks sensible. My discussion of fixed costs was sloppy indeed. I do not think the idea I am trying to convey is dead on arrival, but I should clearly work on its formulation. – Richard Hardy May 17 '21 at 16:23
  • @RichardHardy I would have to read more context of that, I am hesitant to argue with someone as well known as Varian, but I think it could be typo because to me it seems reasonable that would imply that in long run costs are either constant or increasing not decreasing. However, now seeing that quote I must admit that impostor syndrome kicks in and I am bit confused, I have Varians textbook I will have look at the passage and chapter to see what he means. – 1muflon1 May 17 '21 at 16:30
  • I do not think it is a typo, as it is discussed quite extensively and Varian points out it may appear surprising (as it did for you). – Richard Hardy May 17 '21 at 16:31
  • @RichardHardy Well, he is not wrong there I softened the language in my answer about prima facie being false, I will definitely have closer look at that passage – 1muflon1 May 17 '21 at 16:33
  • @RichardHardy I think Varian there assumes that factor costs are constant because he says that he assumes in first part of the chapter that factor costs are constant. There the Mankiw explanation would not apply because that explanation is based on arguing that factor costs rise if firm would try to build more factories at some point and increase production in long-run – 1muflon1 May 17 '21 at 16:49
  • That makes sense. Could we then formulate some assumptions about the markets that would imply fixed factor prices? (Taking it as a primitive appears artificial to me.) E.g. that the producer is too small to influence the prices in the input markets? And conversely, the assumption would not hold and we would have increasing factor prices if the producer is big enough? Or is the size of the producer secondary to some more primitive force? – Richard Hardy May 17 '21 at 17:07
  • @RichardHardy sure a simple assumption would be that firms labor, capital etc demand is so small relative to the market that it cannot affect market price. Real supply curves might not be perfectly smooth so imagine situation where we have “staircase” supply for labor on interval $L\in [3,5]$ the supply horizontal so that if demand for labor changes from 3 to 5 there would be no increase in wage but there would be one if it would push it to 6L. So if you assume that supply is not strictly monotonically increasing, and that demand for labor of firm is small relative to the size of the – 1muflon1 May 17 '21 at 20:08
  • @RichardHardy market then firm would not be able to change price. I am a bit hesitant to call such firms small because small firms can mean something different in different strands of literature that is you can decide based on output, their ability to control prices, assets or number of workers they employ (they would have to be small in the last sense - relative to market supply of course - eg if it’s small firm meaning now for this context it only has need for few employees but labor market is dominated by 3 workers then it would likely not be able to keep wages constant). – 1muflon1 May 17 '21 at 20:10
  • That was a good discussion. The firm should be viewed as small in the sense of not being able to affect the market price, in your exmaple the price of labor. A labor market dominated by 3 workers does not allow for small firms in that sense. – Richard Hardy May 18 '21 at 06:10
  • By the way, I found the same graph in Mankiw's "Principles of Microeconomics" (you mentioned his "Principles of Economics", did not check there). – Richard Hardy May 18 '21 at 15:00
  • 1
    @RichardHardy I think then he reuses the graphs, because I double checked the title and the book I took it from is principles of economics... I guess this is nasty feature of the US textbook market, you have so much mandatory overlapping textbooks that often even cover verbatim the same stuff and then students end up paying on all those additional textbooks :( – 1muflon1 May 18 '21 at 15:07
  • Thank you for a (yet another) great answer! – Richard Hardy May 19 '21 at 10:41
4

Theoretically, the cost function is a result of a cost minimization problem with a given production technology. Convex/linear/concave costs are a result of decreasing/constant/increasing returns to scale. The thinking behind convex costs is the idea of decreasing marginal product of your input goods for production.

As an example for the kind of thinking behind a convex cost function: If you want to produce one widget, you can do it with the 3 most skilled workers in town. If you want to produce two widgets, you can do it with the 7 most skilled workers in town because the 4 additional ones are slower. Alternatively, all workers have the same productivity but first you take the cheapest and the next ones would only do the job for more money. Alternatively, you consider hours worked: A worker can produce one good in three hours, but the second one takes four hours because working is exhausting. Similarly, the first hours are paid on regular contract wherease extra-hours need extra compensation. Alternatively, you need wood for production. For the first goods you can chop wood in your own forest, but once you need more you need to find additional more expensive sources. And so on.

Remember that the supply curve is the increasing part of the marginal cost curve. The supply curve in Econ 101 is upward sloping because of the above intuition. It might be that there are increasing returns to scale, e.g. because workers can divide jobs and there are gains from specialization. Eventually, however, we assume that those gains come to an end at some point as marginal returns diminish.

Next, and this is not a good reason, note that a monopolist optimally produces a quantity such that marginal revenue - marginal cost = 0. Convex costs ensure that this is in fact a maximum.

Bayesian
  • 5,290
  • 2
  • 17
  • 44
  • 1
    It seems we can have different plausible stories with opposite implications. My interest is in finding why the plausible story of convex cost is more convincing than the plausible story of concave costs. I am willing to take additional assumptions to achieve that, and the choice of assumptions seems crucial. Among what you wrote, It might be that there are increasing returns to scale, e.g. because workers can divide jobs and there are gains from specialization. Eventually, however, we assume that those gains come to an end at some point as marginal returns diminish seems to be about it. – Richard Hardy May 17 '21 at 14:41
  • Yes, I find it reasonable that such gains dry up. Your argument seems to be that instead of one big team where there specialization gains dried up, you can make several small teams/small factories does not seem to scale up because recources are scarce. You can't copy the more skilled workers which you would hire first and you cannot copy the easy to get oil/coal. At some point you need to hire less skilled workers and drill deeper to get oil/coal. – Bayesian May 17 '21 at 14:49
  • Very well. These are precisely the assumptions I am looking for! I wonder if there are more, or if there is an explicit set of them (e.g. minimal sufficient set) somewhere. There has to be something about the aggregate in addition, I think, since under perfect competition a small producer is not supposed to have an effect on the input markets (like being able to exhaust them to a degree that would affect input prices). – Richard Hardy May 17 '21 at 14:50
  • For such discussions about the fundamentals, it is always best to open the bible again. Have a look at MWG Section 5.D "geometry of the cost function." – Bayesian May 17 '21 at 14:57
  • I read it once, and what I found was a discussion of multiple shapes of cost functions without paying much attention to how they arise. Yes, there were some hints but only for some cases. Unfortunately, I could not find a discussion relevant to this thread. I found something more interesting in Varian's "Microeconomic Analysis"; see my edit. – Richard Hardy May 17 '21 at 15:42
4

If the cost function is globally concave in output $y$, then

  • the profit function is convex in $y$ and the optimal (profit maximizing) output is not characterized by the equality between price and marginal cost, so price taker firms have an optimal output level that is either 0 or tends to infinity
  • the profit is negative at least for low levels of output (if $c'(0)>p$)

Such a concavity assumption will have difficulties to explain why about 60% of firms produce less than 5% of total output.
For these reasons, the cost functions are probably not concave (globally), unless for firms with strong market power... Instead it is quite plausible that the cost function is locally convex and exhibits nonconvexities here and there.

Bertrand
  • 3,351
  • 1
  • 9
  • 24
  • Thank you. The second bullet point is obviously correct, isn't it? The first bullet point seems intuitively correct, too, as virtually all companies try to expand production as long as there is demand for it and the level of production does not cause the price to fall too much. It is uncommon to see a producer of a commodity saying, well, we should not expand our production any more because we have reached a sweet spot. (I am not talking about monopolies but rather perfect competition and settings close to it.) – Richard Hardy May 18 '21 at 06:04
  • Regarding why about 60% of firms produce less than 5% of total output, I think we may need to look for the answer elsewhere than in the cost function: (1) the markets together with their participants are never in equilibrium but on their way there, and the conditions keep changing so that the equilbrium is constantly moving; (2) there are young companies that are growing and have not reached their optimal production level yet; (3) there is market power (thus not perfect competition) etc. ect. – Richard Hardy May 18 '21 at 06:34
3

Increasing and convex costs are a result of decreasing returns to scale. These are mainly due to the limited availability of (local) input factors. Other contributing factors are the decline of management efficiency of large-scale production, the imperfection of internal supervision and control mechanisms, and more complex information transmission.

VARulle
  • 6,805
  • 10
  • 25
  • I have made an argument for increasing returns to scale. (Regarding decline of management efficiency of large-scale production, the imperfection of internal supervision and control mechanisms, and more complex information transmission, why not split your factory into multiple smaller copies to avoid that?) If the argument is not faulty and the producer is not big enough to affect the market prices of inputs, then I do not find your argument convincing. Please let me know where you think I am mistaken. – Richard Hardy May 17 '21 at 13:42
  • Within a single factory you typically have increasing marginal productivity, so you add wokers and machines until you reach the optimal factory size. Then, if you want to further expand output, you need to set up a second factory, where you repeat the process. When you have multiple factories, you need managers for each one, and the supervision/control/information transmission problems kick in. So n factories are more than n times as costly to run than a single one. – VARulle May 17 '21 at 14:22
  • Then you split the company into multiple one-factory companies. The idea is never to choose an inefficient large entity when better efficiency can be achieved by a number of smaller entities. In other words, always use the most efficient technology. A rational actor would naturally choose that, especially given that the efficient technology is readily available as it has been implemented on a smaller scale / in a smaller number of entities already. (It is not I who came up with that, I read it somewhere a while ago.) – Richard Hardy May 17 '21 at 14:44
  • This does not work as a solution to the problem. Multiple companies would compete against each other - unless you somehow integrate them into a "supercompany" acting as a single unit and maximizing the sum of company profits. But then your supervision/control/information problems are back again. – VARulle May 18 '21 at 07:08
  • It is not a problem that the companies compete with each other. If we are in the setting of perfect competition, adding a couple more companies to the mix will not affect anything and will not be detrimental to the business effectiveness seen from the perspective of the owner of the several companies. – Richard Hardy May 18 '21 at 10:25
  • I can't see why this ("not detrimental...") should follow from assumptions on the market structure. – VARulle May 18 '21 at 14:16
  • You say Multiple companies would compete against each other. I understand you find that troublesome. I do not see why but I tried to guess what you might have thought. I have then tried to explain why it is not troublesome; the owner of the companies does not suffer from the fact these companies compete with each other. Or did I misunderstand you? – Richard Hardy May 18 '21 at 14:32
  • I think the owner of two companies would rather have them each choose half the monopoly quantity than their duopoly quantities. – VARulle May 19 '21 at 07:17
  • Sure, I must agree. The difference is that the companies operate under perfect competition, not under a duopoly. – Richard Hardy May 19 '21 at 10:39
  • Well, you asked "Why are cost functions typically assumed to be convex in producer theory of (introductory) microeconomics?" That's what I tried to answer, pointing out that this is a realistic assumption for real world firms. Perfect competition is a benchmark model with lots of unrealistic assumptions. I don't think you can take these assumptions as a starting point for arguing about the cost structure of firms. It's the other way round: You take a firm with a simple but realistic cost structure, put it into a perfectly competitive environment and see what happens. – VARulle May 19 '21 at 16:55
  • 1
    Thank you. The textbook examples that I encountered had both convex costs and perfect competition, and the contradiction bugged me. In realistic situations, I agree things can be very different. – Richard Hardy May 19 '21 at 17:53
2

In my very limited empirical experience it seemed that cost functions were in fact non-convex for most output levels in the few industries I looked at. Allocating costs to exact parts of a process is very difficult, but marginal costs were generally assumed to be constant, with some jumps as capacity constraints were reached.

The theorems in microeconomics/general equilibrium theory that deal with the existence of solutions to profit maximization problems, the existence of competitive equilibria and Pareto-efficiency of these equilibria are well-liked for their mathematical elegance. However they rely on a bunch of convexity/concavity assumptions. (The branch of math used is convex analysis.)

Hence these assumptions are dictated more by the desire for elegant theoretical solutions rather than empirical knowledge.


Note that there are many possible rationalizations for why cost functions may be convex, some (interesting ones) are outlined in the other answers. I would argue that these are mostly rationalizations of the assumption, not proof of its empirical validity. To be fair, I also do not provide empirical proof.

Giskard
  • 29,387
  • 11
  • 45
  • 76
  • This has been my feeling, too, but I do not trust it. I am cautious to dismiss the basic microeconomic theory. People must have had reasonably convincing arguments besides mathematical elegance to make them widely accepted. Or is it too idealistic on my part to think this way? Also, I am trying to see how this is supposed to work at least in theory if not in practice. There seems to be some trouble with my understanding of the former let alone the latter on which tend I agree with you. – Richard Hardy May 17 '21 at 13:43
  • 2
    @RichardHardy I am not the right person to turn to right now for this sort of advice. I recently decided to leave the discipline, in part due to similar frustrations. In my discussions with other economists the varied answers I received were 1. "of course it is just a model, still it is useful" (I was not convinced by the reasoning on usefulness) 2. "of course it is a bad model, but publish or perish" and 3. "who cares if it is empirically valid? I like math!" – Giskard May 17 '21 at 13:49
  • 1
    I am sorry to hear that. There definitely are cases like you describe. I hope they do not dominate generally, but they may surely plague many. In any case, your help and critical approach is appreciated (surely more broadly than at Economics SE). – Richard Hardy May 17 '21 at 13:59
0

I would intuit the following: consumer utility is defined as a function as $$U=U(C,L)$$ where $C$ is consumption and $L$ is leisure. We normally assume that this function is concave in both arguments. Given a fixed time endowment, leisure becomes more and more valuable the lesser you have of it (consequence of concavity+ Inada conditions).

In general equilibrium, labour demand has to equal labour supply. If a firm wants to increase production from $Q_{1}$ to $Q_{2}$, it will have to incentivize workers by paying higher wages- which will increase at an increasing rate given the concavity of the utility function defined over leisure.

ChinG
  • 1,633
  • 14
  • 20
  • That is a nice example of the phenomenon mentioned in the last paragraph of the OP. It applies directly for a producer that is big enough to affect the price of labour. What about a smaller producer though? (Maybe this is already in the notion of general equilibrium or something? Since my knowledge of microeconomics is very rusty and has never been deep, I cannot tell.) – Richard Hardy May 17 '21 at 12:39
  • (-1) How about companies that are big enough to hire several people, working for monthly salaries rather than an hourly wage? I hear this business model is widespread, and the cost function seem unaffected by labor supply function of individual workers. – Giskard May 17 '21 at 13:27
  • @RichardHardy I will write a more detailed answer if I find the time today- but essentially, in many models, labour is assumed to be perfectly mobile across firms. Given perfect mobility, the size of the firm does not matter- there is only one wage across firms. For your firm to incentivize the marginal worker to come work to increase your own production, you would have to pay higher than the prevailing market wage. – ChinG May 17 '21 at 13:30
  • Last comment was by me, not Hardy. The theoretical model you describe seems ill-fited to reality. I do not doubt its inner mathematical consistency. – Giskard May 17 '21 at 13:31
  • @Giskard I see what you mean. I would also agree that this may not reflect reality accurately. Of course, one could build models of monopsony power, segmented labor markets, labor market frictions etc. to reflect reality better. The above was just one reason that came to mind regarding the convexity of cost functions. – ChinG May 17 '21 at 14:41
  • @ChinG, I would first focus on the level of an introductory textbook to maintain clarity and simplicity. Only if this does not work at an elementary level would I go for more complex models like the ones you mentioned. My expectation was that there is some standard answer that does not require complicated models, though I am not that deep into the field at all to trust my expectations much. – Richard Hardy May 17 '21 at 14:48