234

Wikipedia only lists two problems under "unsolved problems in computer science":

What are other major problems that should be added to this list?

Rules:

  1. Only one problem per answer
  2. Provide a brief description and any relevant links
Iddo Tzameret
  • 1,899
  • 1
  • 19
  • 27
Shane
  • 2,233
  • 3
  • 26
  • 25

60 Answers60

149

Can multiplication of $n$ by $n$ matrices be done in $O(n^2)$ operations?

The exponent of the best known upper bound even has a special symbol, $\omega$. Currently $\omega$ is approximately 2.376, by the Coppersmith-Winograd algorithm. A nice overview of the state of the art is Sara Robinson, Toward an Optimal Algorithm for Matrix Multiplication, SIAM News, 38(9), 2005.

Update: Andrew Stothers (in his 2010 thesis) showed that $\omega < 2.3737$, which was improved by Virginia Vassilevska Williams (in a July 2014 preprint) to $\omega < 2.372873$. These bounds were both obtained by a careful analysis of the basic Coppersmith-Winograd technique.

Update (Jan 30, 2014): François Le Gall has proved that $\omega < 2.3728639$ in a paper published in ISSAC 2014 (arXiv preprint).

Update (Nov 4, 2023): "New Bounds for Matrix Multiplication: from Alpha to Omega" proved that $\omega \leq 2.371552$.

hedgehog0
  • 3
  • 3
András Salamon
  • 19,000
  • 3
  • 64
  • 150
  • How about a modest and realistic goal of $O(n^2 \log n)$ or some other function between $n^{2+\epsilon}$ and $n^2$? After all it is expected that integer multiplication has the lower bound of $O(n \log n)$. – Mitch Sep 13 '10 at 17:16
  • I'm not sure going from even $2+0.376$ to $2+\epsilon$ is regarded as a "modest and realistic goal", let alone then going below $2+\epsilon$. But it would be great to see some progress, so give it a shot! – András Salamon Sep 13 '10 at 17:39
  • 13
    The matrix multiplication exponent is defined to be the smallest real number $\omega$ such that $O(n^{\omega+\epsilon})$ arithmetic operations suffice for all $\epsilon>0$. Probably a factor like $\log n$ should be expected. – Zeyu Nov 13 '10 at 02:39
  • 3
    Just adding for the sake of completeness wrt current knowledge that CW bound was bettered a few days ago by Virginia Williams. And as noted by many others in the community, Andrew Stothers had obtained his bound beating CW's around one year before Virginia. The current record is $O(n^{2.373})$ – Akash Kumar Dec 08 '11 at 22:27
  • I will just let this here http://research.microsoft.com/en-us/um/people/kannan/papers/matrixmult.pdf – raven Feb 02 '12 at 09:51
  • @JaimePardos: the full version of the approximate matrix multiplication paper you refer to is http://www.stanford.edu/class/ee378B/papers/drineas-kannan-mahoney-monte-carlo1.pdf which focuses on algorithms with few passes over the data, with a more generally useful analysis of running time. TL;DR: matrix multiplication can be approximated for some kinds of matrices in $O(n^2)$ time. – András Salamon Feb 03 '12 at 11:53
  • Even more embarrassing that we don't know the optimal algorithms for smallish matrices. NSF/DARPA should really fund cranking these out on spare supercomputing cycles so the chip makers can start cranking out optimal 64 x 64 MATMUL accelerators. Also a low power 8x8 unit would be a killer application for Micron's new venture to sprinkle compute blocks around RAM. – Chad Brewbaker Jan 31 '14 at 17:35
  • @AndrásSalamon I think it is known time complexity has to be $\omega(n^2)$ since circuit complexity is at least this much (http://dl.acm.org/citation.cfm?id=509932 result by Raz). – Turbo Sep 22 '16 at 16:37
129

Is Graph Isomorphism in P?

The complexity of Graph Isomorphism (GI) has been an open question for several decades. Stephen Cook mentioned it in his 1971 paper on NP-completeness of SAT.

Determining whether two graphs are isomorphic can usually be done quickly, for instance by software such as nauty and saucy. On the other hand, Miyazaki constructed classes of instances for which nauty provably requires exponential time.

Read and Corneil reviewed the many attempts to tackle the complexity of GI up to that point: The Graph Isomorphism Disease, Journal of Graph Theory 1, 339–363, 1977.

GI is not known to be in co-NP, but there is a simple randomized protocol for Graph Non-Isomorphism (GNI). So GI (= co-GNI) is therefore believed to be "close to" NP ${}\cap{}$ co-NP.

On the other hand, if GI is NP-complete, then the Polynomial Hierarchy collapses. So GI is unlikely to be NP-complete. (Boppana, Håstad, Zachos, Does co-NP Have Short Interactive Proofs?, IPL 25, 127–132, 1987)

Shiva Kintali has a nice discussion of the complexity of GI at his blog.

Laszlo Babai proved that Graph Isomorphism is in quasipolynomial time.

Zach Hunter
  • 625
  • 3
  • 14
Kaveh
  • 21,577
  • 8
  • 82
  • 183
96

Complexity of Factoring

Is Factoring in $\mathsf{P}$?

Kaveh
  • 21,577
  • 8
  • 82
  • 183
  • Any good publications you know of that describe factoring or primality testing complexity in terms of the structure of the semigroup of addition and multiplication transforms on Z_n? For example on $Z_3$ [0,1,2] is the +0|x1 transform, [1,2,0] is the +1 transform ... – Chad Brewbaker Jan 31 '14 at 17:45
77

P = BPP?

didest
  • 1,551
  • 16
  • 25
71

Is there a pivoting rule for the simplex algorithm that yields worst-case polynomial running time? More generally, is there any strongly polynomial algorithm for linear programming?

Anand Kulkarni
  • 853
  • 6
  • 15
Jeffε
  • 23,129
  • 10
  • 96
  • 163
  • 11
    I'll add to this question: would showing the nonexistence of strongly polynomial LP imply any class separation results? – Anand Kulkarni Aug 27 '10 at 23:59
  • ,,,and the Hirsch conjecture... – Sariel Har-Peled Nov 10 '11 at 04:27
  • 8
    In 2011, Oliver Friedmann showed exponential lower bounds for many pivoting rules (he actually claims "essentially all natural" pivoting rules, including Random Facet and Random Edge). These bounds apply when solving a linear program derived from 2-player parity games. Friedmann's thesis http://edoc.ub.uni-muenchen.de/13294/ surveys the history in some depth (including various forms of the Hirsch Conjecture, and the 2010 counterexample to the strong form by Francisco Santos). – András Salamon Jan 30 '12 at 16:11
67

The exponential-time hypothesis (ETH) asserts that solving SAT requires exponential, 2Ω(n) time. ETH implies many things, for instance that SAT is not in P, so ETH implies P ≠ NP. See Impagliazzo, Paturi, Zane, Which Problems Have Strongly Exponential Complexity?, JCSS 63, 512–530, 2001.

ETH is widely believed, but likely to be difficult to prove, as it implies many other complexity class separations.

András Salamon
  • 19,000
  • 3
  • 64
  • 150
Jeffε
  • 23,129
  • 10
  • 96
  • 163
62

Immerman and Vardi show that fixed-point logic captures PTIME on the class of ordered structures. One of the biggest open problems in descriptive complexity theory is whether the dependency on the order can be removed:

Is there a logic that captures PTIME?

Put simply, a logic capturing PTIME is a programming language for graph problems that works directly on the graph structure and does not have access to the encoding of the vertices and edges, such that the following hold:

  1. any syntactically correct program models a polynomial-time computable graph problem and
  2. any polynomial-time computable graph problem can be modelled by a syntactically correct program.

If there is no logic that captures PTIME, then $P \neq NP$ since NP is captured by existential second-order logic. A logic capturing PTIME would provide a possible attack to P vs NP.

See Lipton's blog for an informal discussion and M. Grohe: The Quest for a Logic Capturing PTIME (LICS 2008) for a more technical survey.

András Salamon
  • 19,000
  • 3
  • 64
  • 150
Holger
  • 975
  • 9
  • 17
  • 3
    Immerman-Vardi shows FO(LFP) captures logic on ordered structures, so this is a question about capturing PTIME on arbitrary finite models, I take it. If I understand you correctly, isn't this question a translation of asking whether P != NP? It might be more pointed to ask one or more of the open problems in the survey you link to. Apologies if I am being clueless here. – Aaron Sterling Aug 18 '10 at 12:26
  • 5
    Thanks, I edited the answer to mention Immerman-Vardi for clarification. No, this open problem is not known to be equivalent to P vs NP. The open problems in the survey are special cases of the big open problem and not appropriate in this thread. Maybe this reference is also helpful: http://rjlipton.wordpress.com/2010/04/05/graph-isomorphism-and-graph-minors/ – Holger Aug 18 '10 at 16:25
57

Is the unique games conjecture true?
And: Given that there are sub-exponential time approximation algorithms for Unique Games, where does the problem ultimately rest in terms of the complexity landscape?

Anand Kulkarni
  • 853
  • 6
  • 15
Daniel Apon
  • 6,001
  • 1
  • 37
  • 53
  • Would it not be more precise to say that if the UGC is not true (i.e. unique games are not NP-hard, just harder than P), where would UGC fit into the landscape? – András Salamon Aug 17 '10 at 17:55
  • Oops. Yes, I should reword this. My intention was to highlight the the apparent discrepancy that results from unique games having a non-trivial approximation algorithm in sub-exponential (but not quite polynomial) time. More of: What does this say, if sub-exponential run time is optimal for unique games? – Daniel Apon Aug 17 '10 at 18:58
  • 2
    In retrospect, I thought I should include a pointer toward this pre-print. In my opinion, it's as big of a development as the paper I have linked in the answer. – Daniel Apon Aug 31 '10 at 11:09
  • 2
    It's worth noting that there are no known hard instances of UCG. The current best approach works efficiently in every tested case. We just can't prove that we have found the most pathological examples. – Stella Biderman Jan 06 '17 at 21:28
57

Permanent versus Determinant

The permanent versus determinant question is interesting because of two facts. First, the permanent of a matrix counts the number of perfect matchings in a bipartite graph. Therefore the permanent of such a matrix is #P-Complete. At the same time, the definition of the permanent is very close that of the determinant, ultimately different only because of a simple sign change. Determinant calculations are well known to be in P. Studying the different between the permanent and the determinant, and how many determinant calculations are required to compute the permanent speak about P versus #P.

Kaveh
  • 21,577
  • 8
  • 82
  • 183
Ross Snider
  • 2,035
  • 2
  • 20
  • 33
  • 7
    To me this doesn't qualify as a "major open problem", because the actual complexity theoretic question (do they have different complexities) is subsumed by P=NP (since #P is a superset of NP) and with that question set aside there isn't a concrete problem posed here. – David Eppstein Dec 09 '11 at 00:59
  • I actually agree with this. – Ross Snider Dec 09 '11 at 04:25
  • 11
    @DavidEppstein: Per v. det is closer to GapP v GapL, a counting analog of NP v NL. It's possible that $NL \neq P=NP$ and hence $GapP \neq GapL$. Also, per v det is much older than P v NP, essentially going back to [Polya 1913], in which he shows that one cannot affix signs to a matrix to change its permanent to its determinant (except 2x2). Valiant introduced a variant on those questions (allowing size of det to be larger than n) because of its significance in complexity, but even the pre-Valiant works give the motivation "because the permanent is so hard to compute..." (eg Gibson 1971) – Joshua Grochow Mar 02 '12 at 17:38
  • What are the state of the art algorithms now for calculating the permanent of a 0-1 matrix? i.e. the number of legal permutation matrices you can generate from a subset of the 1's. – Chad Brewbaker Jan 31 '14 at 18:10
  • @ChadBrewbaker: see Mark Jerrum, Alistair Sinclair, Eric Vigoda, "A Polynomial-Time Approximation Algorithm for the Permanent of a Matrix with Non-Negative Entries", Journal of the ACM 51/4 (2004), 671, http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.141.116 – Zsbán Ambrus Feb 05 '15 at 07:22
49

The dynamic optimality conjecture for splay trees.

Or more generally: Is any online dynamic binary search tree O(1)-competitive?

Jeffε
  • 23,129
  • 10
  • 96
  • 163
49

Can we compute the FFT in much less than $O(n \log n)$ time?

In the same (very) general vein, there are many questions of improving the run-times of many classical problems or algorithms: e.g., can all-pairs-shortest-paths (APSP) be solved in $O(n^{3-\epsilon})$ time ?

Edit: APSP runs in time $(\frac{n^3}{2^{\Omega(log n)^{1/2}}})$ "where additions and comparisons of reals are unit cost (but all other operations have typical logarithmic cost)": http://arxiv.org/pdf/1312.6680v2.pdf

Ryan Dougherty
  • 555
  • 3
  • 19
Alex Andoni
  • 416
  • 3
  • 7
  • 5
    An interesting development on FFT: "* An O(k log n)-time algorithm for the case where the input signal has at most k non-zero Fourier coefficients, and
    • An O(k log n log(n/k))-time algorithm for general input signals." source: http://arxiv.org/abs/1201.2501v1
    – Shadok Feb 02 '12 at 16:20
47

A linear time deterministic algorithm for the minimum spanning tree problem.

Artem Kaznatcheev
  • 10,251
  • 12
  • 74
  • 174
Suresh Venkat
  • 32,071
  • 4
  • 95
  • 271
45

NP versus co-NP

The NP versus co-NP question is interesting because NP ≠ co-NP implies P ≠ NP (as P is closed under complement). It also relates to "duality": separation between finding/verifying examples and finding/verifying counterexamples. In fact, proving that a question is in both NP and co-NP is our first good evidence that a problem that seems to be outside of P is also likely not NP-Complete.

András Salamon
  • 19,000
  • 3
  • 64
  • 150
Ross Snider
  • 2,035
  • 2
  • 20
  • 33
  • 7
    This is also related to propositional proof complexity. There is a polynomial propositional proof system iff $NP$ is equal to $coNP$. – Kaveh Aug 17 '10 at 22:01
42

Do all propositional tautologies have polynomial-size Frege proofs?

Arguably the major open problem of proof complexity: demonstrate super-polynomial size lower bounds on propositional proofs (called also Frege proofs).

Informally, a Frege proof system is just a standard propositional proof system for proving propositional tautologies (one learns in a basic logic course), having axioms and deduction rules, where proof-lines are written as formulas. The size of a Frege proof is the number of symbols it takes to write down the proof.

The problem then asks whether there is a family $(F_n)_{n=1}^\infty$ of propositional tautological formulas for which there is no polynomial $ p $ such that the minimal Frege proof size of $ F_n $ is at most $ p(|F_n|)$, for all $ n=1,2,\ldots$ (where $ |F_n| $ denotes the size of the formula $ F_n $).


Formal definition of a Frege proof system

Definition (Frege rule) A Frege rule is a sequence of propositional formulas $ A_0(\overline x),\ldots,A_k(\overline x) $, for $ k \le 0 $, written as $ \frac{A_1(\overline x), \ldots,A_k(\overline x)}{A_0(\overline x)}$. In case $ k = 0 $, the Frege rule is called an axiom scheme. A formula $ F_0 $ is said to be derived by the rule from $ F_1,\ldots,F_k $ if $ F_0,\ldots,F_k $ are all substitution instances of $ A_1,\ldots,A_k $, for some assignment to the $ \overline x $ variables (that is, there are formulas $B_1,\ldots,B_n $ such that $F_i = A_i(B_1/x_1,\ldots,B_n/x_n), $ for all $ i=0,\ldots,k $. The Frege rule is said to be sound if whenever an assignment satisfies the formulas in the upper side $A_1,\ldots,A_k $, then it also satisfies the formula in the lower side $ A_0 $.

Definition (Frege proof) Given a set of Frege rules, a Frege proof is a sequence of formulas such that every proof-line is either an axiom or was derived by one of the given Frege rules from previous proof-lines. If the sequence terminates with the formula $ A $, then the proof is said to be a proof of $ A $. The size of a Frege proof is the the total sizes of all the formulas in the proof.

A proof system is said to be implicationally complete if for all set of formulas $ T $, if $ T $ semantically implies $ F $, then there is a proof of $ F $ using (possibly) axioms from $ T $. A proof system is said to be sound if it admits proofs of only tautologies (when not using auxiliary axioms, like in the $ T $ above).

Definition (Frege proof system) Given a propositional language and a finite set $ P $ of sound Frege rules, we say that $ P $ is a Frege proof system if $ P $ is implicationally complete.

Note that a Frege proof is always sound since the Frege rules are assumed to be sound. We do not need to work with a specific Frege proof system, since a basic result in proof complexity states that every two Frege proof systems, even over different languages, are polynomially equivalent [Reckhow, PhD thesis, University of Toronto, 1976].


Establishing lower bounds on Frege proofs could be viewed as a step towards proving $NP \neq coNP$, since if this is true then no propositional proof system (including Frege) can have polynomial size proofs for all tautologies.

Iddo Tzameret
  • 1,899
  • 1
  • 19
  • 27
41

Are there problems that cannot be solved efficiently by parallel computers?

Problems that are P-complete are not known to be parallelizable. P-complete problems include Horn-SAT and Linear Programming. But proving that this is the case would require separating some notion of parallelizable problems (such as NC or LOGCFL) from P.

Computer processor designs are increasing the number of processing units, in the hope that this will yield improved performance. If fundamental algorithms such as Linear Programming are inherently not parallelizable, then there are significant consequences.

András Salamon
  • 19,000
  • 3
  • 64
  • 150
  • 16
    I'm pretty sure that LP algorithms, as they stand today, are not parallelizable. I believe they fit into Mulmuley's RAM-without-bit-operations model. In http://dx.doi.org/10.1137/S0097539794282930 K. Mulmuley. Lower Bounds in a Parallel Model without Bit Operations. SIAM J. Comput. 28 (4), 1460-1509 (1999) he shows that $P \neq NC$ in that model, showing that many natural (usually numerical) algorithms for $P$-complete problems are not parallelizable. This does not answer the question in the boolean case, but it does answer it for a large class of natural algorithms. – Joshua Grochow Aug 24 '10 at 06:33
39

Are there truly subquadratic-time algorithms (meaning $O(n^{2-\delta})$ time for some constant $\delta>0$) for 3SUM-hard Problems?

In 2014, Grønlund and Pettie described a deterministic algorithm for 3SUM itself that runs in time $O(n^2/(\log n/\log \log n)^{2/3})$. Although this is a major result, the improvement over $O(n^2)$ is only (sub)logarithmic. Moreover, no similar subquadratic algorithms are known for most other 3SUM-hard problems.

Jeffε
  • 23,129
  • 10
  • 96
  • 163
Aryabhata
  • 1,855
  • 3
  • 21
  • 29
  • 9
    Good question. However, the existence of sub-quadratic algorithms for the 3SUM problem is wide open even for randomized algorithms. Of course, deterministic algorithm would have been even nicer.. – Piotr Aug 25 '10 at 19:36
  • 3
    In the quantum case, there are known matching n log(n) lower and upper bounds for 3SUM:

    Andrej Dubrovsky, Oksana Scegulnaja-Dubrovska Improved Quantum Lower Bounds for 3-Sum Problem. Proceedings of Baltic DB&IS 2004, vol. 2, Riga, Latvia, pp.40-45.

    – Martin Schwarz Aug 25 '10 at 20:34
  • @Piotr: Thanks! Have edited. – Aryabhata Aug 25 '10 at 20:42
  • @Martin, that's quite interesting. I need to see what those upper bounds look like ! – Suresh Venkat Sep 01 '10 at 16:50
  • 1
    I was under the impression that we do not have n^2 lower bound for any problem in NP. – Sariel Har-Peled Nov 10 '11 at 04:40
  • @SarielHar-Peled That's an interested claim that I hadn't heard before. There are certainly problems in NP where the output is quadratic on the size of the input, and so a quadratic lower bound is trivial. Are you saying that we do not have a quadratic bound on, say, the max of the input and output sizes for a problem in NP? Or that we don't have a problem in NTIME($f(n)$) that is not in DTIME($o(f(n)^2)$)? I would love to see a source or further discussion of this. – William Macrae Nov 28 '12 at 02:14
  • 1
    I had the distinct impression that if you are restricted to decision problems (no output arguments), then nothing is known. But you should really ask a complexity person. – Sariel Har-Peled Dec 01 '12 at 05:42
  • 5
    A recent arXiv paper claims to have settled this conjecture by giving sub-quadratic algorithms for 3-SUM. – Mangara Jul 07 '14 at 17:33
39

Can we compute the edit distance between two strings of length $n$ in sub-quadratic time, i.e., in time $O(n^{2-\epsilon})$ for some $\epsilon>0$ ?

Piotr
  • 971
  • 7
  • 15
  • 8
    Do you have references for that? I actually thought that this proposition was trivially false although I can’t think of a proof off the top of my head. (Although I’m aware that the runtime can be made dependent on the number of errors.) – Konrad Rudolph Sep 11 '10 at 12:00
  • 6
    Update (STOC 2015): Backurs and Indyk give evidence that better-than-quadratic time is not possible. See https://rjlipton.wordpress.com/2015/06/01/puzzling-evidence/ . – Neal Young Oct 11 '15 at 01:36
35

BQP = P?

Also: NP contained in BQP?

I know this violated the rules by having two questions in the answer, but when taken with the P vs NP question, they are not necessarily independent questions.

Joe Fitzsimons
  • 13,675
  • 47
  • 91
33
  1. Isomorphism Conjecture. (Are all NP-complete problems the "same" problem?)
  2. Can cryptography be based upon an NP-complete problem?

  3. and, a little further away from the mainstream:

  4. What is the size of NP within EXP?

(Informally, if you have all problems in EXP on a table, and you pick one up uniformly at random, what is the probability that the problem you chose is also in NP? This question has been formalized by the notion of resource-bounded measure. It is known that P has measure zero within EXP, i.e., the problem you picked up from the table is almost surely not in P.)

Daniel Apon
  • 6,001
  • 1
  • 37
  • 53
Aaron Sterling
  • 6,994
  • 6
  • 42
  • 74
  • Is this the same as p-measure in the Complexity Zoo? Where would I go to read more about it? – András Salamon Aug 17 '10 at 17:48
  • 2
    P-measure is one example of resource-bounded measure: more generally, you can imagine a machine trying to predict a sequence, and the computational resources it has available to do so are what provides the resource-bound on the measure. I used p-measure in my informal explanation of EXP on a table. For further reading, I recommend the journal version of the following survey by Lutz (the CZ cites the conference version of this survey). http://www.cs.iastate.edu/~lutz/=PAPERS/qset.ps (in postscript, I hope that's ok) – Aaron Sterling Aug 17 '10 at 18:02
  • Thanks. Here is a PDF of that paper for those who can't read PS: http://archives.cs.iastate.edu/documents/disk0/00/00/01/28/00000128-01/TR96-08.pdf – András Salamon Aug 17 '10 at 23:17
  • Interesting. Is it correct that NP not having measure 0 would imply P \neq NP? Would NP having measure 0 have any big implications? – Opt Aug 18 '10 at 01:32
  • 2
    Yes to your first question. P has measure 0 in EXP, so if NP does not, you get P != NP immediately. For the second question, I suggest you read the last paragraph of page 28 in the survey Andras and I linked to. (Not enough space in the comment to paste it here, sorry.) Basically, if NP has measure zero, there exists an feasible algorithm that could guess membership in an NP-hard problem "unreasonably" well. So it seems likely that NP is not measure zero within EXP. – Aaron Sterling Aug 18 '10 at 02:24
  • Would be nice to have a link to some info about the isomorphism conjecture – Artem Kaznatcheev Aug 16 '11 at 05:30
  • 1
    @Artem: you could start here: http://blog.computationalcomplexity.org/2003/03/berman-hartmanis-isomorphism.html?m=1 – Aaron Sterling Aug 16 '11 at 11:15
31

What is the approximability of Metric TSP? Christofides' Algorithm from 1975 is a polynomial-time (3/2)-approximation algorithm. Is it NP-hard to do better?

  • Approximating Metric TSP to within a factor smaller than 220/219 is NP-hard (Papadimitriou and Vempala, 2006 [PS]). To my knowledge this is the best known lower bound.

  • There is some evidence suggesting that the actual bound may be 4/3 (Carr and Vempala, 2004 [Free version] [Good version]).

  • The upper bound on approximability was recently lowered to $13/9$ (Mucha 2011 "13/9 -approximation for Graphic TSP" [PDF])

Glorfindel
  • 359
  • 1
  • 4
  • 10
James King
  • 2,613
  • 18
  • 25
30

Give an explicit function with exponential circuit complexity.

Shannon proved in 1949 that if you pick a Boolean function at random, it has exponential circuit complexity with probability almost one.

The best lower bound for an explicit Boolean function $f:\{0,1\}^n \to \{0,1\}$ we have so far is $5n - o(n)$ by K. Iwama, O. Lachish, H. Morizumi, and R. Raz.

Marc
  • 446
  • 3
  • 7
  • 11
    This way of stating the problem always bugs me, because you have to be careful about what you mean by "explicit". It is easy to write down a description of a function that has exponential circuit complexity. If "explicit" means "computable in exponential time or less", then I agree, this is a major open problem. – Ryan Williams Sep 13 '10 at 17:12
  • 1
    Ryan, you are right. This is an extremely important point. It is also easy to write down a description of an uncomputable function. In the paper I cite, the lower bound is proved for a function that is constructible in deterministic polynomial time. – Marc Sep 14 '10 at 12:05
  • Is there a good exposition on Shannon's work? – Turbo Nov 15 '10 at 23:22
  • 3
    The argument is detailed in the following lecture notes: http://www.math.tau.ac.il/~zwick/scribe-boolean.html – Marc Nov 16 '10 at 19:19
  • This is an excellent problems and brings back fond memories of being assigned Shanon's result my second year of university. – Stella Biderman Jan 06 '17 at 21:31
  • @RyanWilliams Is there explicit functions with general formula of superlinear size or is it an open problem related to explicit functions with superlinear circuit complexity? DeMorgan formula superlinear functions are known, – VS. Sep 22 '19 at 19:11
  • I'm not exactly sure what you're asking (you mention both formula and circuit complexity) but the best known formula lower bound over the basis of all 2-bit Boolean formulas is about $n^2$. For the DeMorgan basis it is about $n^3$. These are achieved by an explicit function (called Andreev's function). – Ryan Williams Sep 22 '19 at 23:00
29

What is the query complexity of testing triangle-freeness in dense graphs (i.e., distinguishing triangle-free graphs from those $\epsilon$-far from being triangle-free)? The known upper bound is a tower of exponentials in $1/\epsilon$, while the known lower bound is only mildly superpolynomial in $1/\epsilon$. This is a pretty basic question in extremal graph theory/additive combinatorics that has been open for nearly 30 years.

arnab
  • 7,000
  • 1
  • 38
  • 55
27

Separate NEXP from BPP. People tend to believe BPP=P, but no one can separate NEXP from BPP.

27

Derandomization of the Polynomial Identity Testing problem

The problem is the following: Given an arithmetic circuit computing a polynomial $P$, is $P$ identically zero?

This problem can be solved in randomized polynomial time but is not known to be solvable in deterministic polynomial time.

Related is Shub and Smale's $\tau$ conjecture. Given a polynomial $P$, we define its $\tau$-complexity $\tau(P)$ as the size of the smallest arithmetic circuit computing $P$ using the sole constant $1$. For a univariate polynomial $P\in\mathbb Z[x]$, let $z(P)$ be its number of real roots.

Prove that there exists a universal constant $c$ such that for every $P\in\mathbb Z[x]$, $z(P)\le (1+\tau(P))^c$.

Bruno
  • 4,449
  • 33
  • 45
26

The area of parameterized complexity has its own load of open problems.

Consider the decision problems

  • given $(G,k)$ does there exist a vertex cover of size $k$ for the graph $G$?
  • given $(F,k)$ does there exist a satisfying assignment of weight $k$ for the formula $F$?
  • given $(G,k)$ does there exist a clique of size $k$ in a graph $G$?
  • etc...

Many, MANY, combinatorial problems exist in this form. Parameterized complexity consider an algorithm to be "efficient" if its running time is upper bounded by $f(k)n^c$ where $f$ is an arbitrary function and $c$ is a constant independent of $k$. In comparison notice that all such problems can be easily solved in $n^{O(k)}$.

This framework models the cases in which we are looking for a small combinatorial structure and we can afford exponential run-time with respect to the size of the solution/witness.

A problem with such an algorithm (e.g. vertex cover) is called Fixed Parameter Tractable (FPT).

Parameterized complexity is a mature theory and has both strong theoretical foundations and appeal for practical applications. Decision problems interesting for such theory form a very well structured hierarchy of classes with natural complete problems:

$$ FPT \subseteq W[1] \subseteq W[2] \subseteq \ldots \subseteq W[i] \subseteq W[i+1] \subseteq \ldots W[P] $$

Of course it is open if any of such inclusion is strict or not. Notice that if $FPT=W[1]$ then SAT has subexponential algorithm (this is non trivial). Last statement connects prameterized complexity with $ETH$ mentioned above.

Also notice that investigating such collapses is not an empty exercise: proving that $W[1]=FPT$ is equivalent to prove that there is a fixed parameter tractable algorithm for finding $k$-cliques.

MassimoLauria
  • 1,841
  • 16
  • 21
26

I know the OP asked for only one problem per post, but the RTA (Rewriting Techniques and their Applications) 1 and TLCA (Typed Lambda Calculi and their Applications) conferences both maintain lists of open problems in their fields 2. These lists are quite useful, as they also include pointers to previous work done on attempting to solve these problems.

Dominic Mulligan
  • 837
  • 9
  • 15
26

Is the discrete logarithm problem in P?

Let $G$ be a cyclic group of order $q$ and $g,h \in G$ such that $g$ is a generator of $G$. The problem of finding $n \in \mathbb{N}$ such that $g^n = h$ is known as the discrete logarithm problem (DLP). Is there a (classical) algorithm for solving the DLP in worst-case polynomial-time in the number of bits of $q$?

There are variations of DLP which are believed to be easier, but are still unsolved. The computational Diffie-Hellman problem (CDH) asks for finding $g^{a b}$ given $g, g^a$ and $g^b$. The decisional Diffie-Hellman problem (DDH) asks for deciding, given $g, g^a, g^b, h \in G$, if $g^{a b} = h$.

Clearly DLP is hard if CDH is hard, and CDH is hard if DDH is hard, but no converse reductions are known, except for some groups. The assumption that DDH is hard is key to the security of some cryptosystems, such as ElGamal and Cramer-Shoup.

Juan Bermejo Vega
  • 1,405
  • 1
  • 14
  • 30
25

Is there a Quantum PCP theorem?

Robin Kothari
  • 13,617
  • 2
  • 60
  • 116
25

There are a lot of open problems in lambda calculi (typed and untyped). See the TLCA list of open problems for details; there is also a nice PDF version without the frames.

I particularly like problem #5:

Are there terms untypable in $F_ω$ but typable with help of positive recursive types?

Jacques Carette
  • 1,530
  • 13
  • 23
24

Parity games are two-player infinite-duration graph games, whose natural decision problem is in NP and co-NP, and whose natural search problem in PPAD and PLS.

http://en.wikipedia.org/wiki/Parity_game

Can parity games be solved in polynomial time?

(More generally, a long-standing major open question in mathematical programming is whether P-matrix Linear Complementarity Problems can be solved in polynomial time?)

Rahul Savani
  • 1,703
  • 16
  • 18
23

Sensitivity versus block sensitivity

Boolean sensitivity is interesting because block sensitivity, a close relative, is polynomially related to several other important and interesting complexity measures (like the certificate complexity of a boolean function). If sensitivity is always related to block sensitivity in a polynomial way, we have an extremely simple characteristic of boolean function that's related to so many others.

One might read Rubinstein's "Sensitivity vs. block sensitivity of Boolean functions" or Kenyon and Kutin's "Sensitivity, block sensitivity, and l-block sensitivity of boolean functions."

Ross Snider
  • 2,035
  • 2
  • 20
  • 33
23

Is BQP in PH (polynomial hierachy)?

Opt
  • 1,311
  • 14
  • 20
21

Are NP-completeness in the sense of Cook and NP-completeness in the sense of Karp different concepts, assuming P $\neq$ NP?

vb le
  • 4,828
  • 1
  • 37
  • 46
21

Does there exist any hypothesis class that is NP-Hard to (improperly) PAC learn?

This has some possible implications for complexity, and I think the best progress on this question is here: http://www.cs.princeton.edu/~dxiao/docs/ABX08.pdf

Lev Reyzin
  • 11,968
  • 13
  • 63
  • 103
18

What about proving BPP is contained in NP? (Unconditionally; we already know that BPP=P assuming pretty reasonable complexity assumptions)

Dana Moshkovitz
  • 10,979
  • 1
  • 50
  • 77
18

The "P vs NP" question extends naturally to polynomial-time hierarchy (PH): "Whether PH has infinite levels, or it collapses to some finite level?"

I think this question is (or should be considered as) the most intriguing question of the computer science: If PH has infinite levels, then $\mathbf{P} \neq \mathbf{NP}$. In addition, several researchers have shown that if Graph Isomorphism is NP-complete, then PH collapses to the 2nd level. Therefore, if PH has infinite levels, then Graph Isomorphism is provably not NP-complete.

Several other results follow from the infiniteness of the levels of PH.

Sadeq Dousti
  • 16,479
  • 9
  • 69
  • 152
16

Is there an algorithm to compute the generalized star-height of a given regular language?

See http://en.wikipedia.org/wiki/Generalized_star_height_problem

Generalized regular expressions are defined like regular expressions, but they allow the complement operator. The generalized star height (gsh) of a regular language is the minimum nesting depth of Kleene stars needed to represent the language by a generalized regular expression. Regular languages of gsh 0 (also known as star-free languages) have two nice characterizations: Schützenberger gave an algebraic characterization (their syntactic monoid is aperiodic) and McNaughton showed they correspond to FO[<].

It follows that there are languages of gsh $1$, like $(aa)^*$, but no language of gsh $> 1$ is known! Thus a subproblem would be first to find such a language, or to prove that all regular languages have gsh 1. See also http://www.liafa.univ-paris-diderot.fr/~jep/Problemes/starheight.html

J.-E. Pin
  • 4,831
  • 26
  • 42
  • Welcome to cstheory.stackexchange! Very interesting that no language of gsh > 1 is known. Are there even candidate languages that people think might not have gsh = 1? Or a regular language whose gsh is unknown? Or is it that all regular languages that anyone has ever checked have gsh$ \leq 1$? – Joshua Grochow Aug 08 '13 at 13:58
  • 2
    @JoshuaGrochow It is likely that complicated examples can be found as follows: take a large permutation group, say $S_9$ generated by a cycle $a$ and a transposition $b$. Now consider the (regular) language of all words whose value in $S_9$ is the identity. One can hope that this language has gsh $> 1$. On the other hand, some people believe that every regular language has gsh $\leqslant 1$. Actually, one could define an "intermediate star-height" by replacing complement by intersection, but even so, I don't know of any language of intermediate star-height $> 1$. – J.-E. Pin Aug 09 '13 at 22:54
15

The word "major" is a bit frightening and takes us to the P/NP and related questions. Among the almost-major problems which might be feasible, one that I like is the question of randomized decision trees for graph properties. Is it true that for every non-trivial monotone graph property for graphs with n vertices the expected number of queries that you need to ask in order to know if the graph satisfy the property is constant n^2.

This conjecture is known as the Aanderaa-Karp-Rosenberg conjecture.

Robin Kothari
  • 13,617
  • 2
  • 60
  • 116
Gil Kalai
  • 6,033
  • 35
  • 73
15

Is there a best algorithm for integer multiplication and matrix multiplication (MM), or for that matter any other familiar problem? Manuel Blum has suggested these are good candidates not to have a best algorithm. Among bilinear identities such as Strassen's there is no best one according to Coppersmith and Winograd (1982). If the conjectures of Umans et al are correct, then there is no best algorithm of the type they study. For relevant articles Google "Speedup for Natural Problems".

  • Can you be more specific on you mean by a "best algorithm"? (There is a general linear speed-up theorem.) – Kaveh Nov 16 '10 at 04:31
15

K-server conjecture and randomized K-server conjecture.

Definition according to wikipedia

An online algorithm must control the movement of a set of k servers, represented as points in a metric space, and handle requests that are also in the form of points in the space. As each request arrives, the algorithm must determine which server to move to the requested point. The goal of the algorithm is to keep the total distance all servers move small, relative to the total distance the servers could have moved by an optimal adversary who knows in advance the entire sequence of requests.

Artem Kaznatcheev
  • 10,251
  • 12
  • 74
  • 174
Ashwinkumar B V
  • 1,584
  • 12
  • 20
15

Another open problem for the lambda calculus (from TLCA list of open problems; PDF version ).

Problem #22 on the list:

Is there a continuously complete CPO model of the $\lambda$-calculus whose theory is precisely λβη or λβ ?

Jacques Carette
  • 1,530
  • 13
  • 23
15

Proving the existence of hard-on-average problems in NP using the P≠NP assumption.

Bogdanov and Trevisan, Average-Case Complexity, Foundations and Trends in Theoretical Computer Science Vol. 2, No 1 (2006) 1–106

Mohammad Al-Turkistany
  • 20,928
  • 5
  • 63
  • 149
14

Summary Table for Answers

Open Problems

Open Problem Lists

Kaveh
  • 21,577
  • 8
  • 82
  • 183
14

Unconditional derandomization of Arthur-Merlin games. It is also known that under hardness assumptions AM = NP. The question is, can we prove unconditionally that AM is a subset of sigma sub 2: http://cs.haifa.ac.il/~ronen/online_papers/online_papers.html

Vincent Russo
  • 935
  • 1
  • 6
  • 21
13

Open Problem Garden hosts a number of unsolved problems in theoretical computer science.

Lev Reyzin
  • 11,968
  • 13
  • 63
  • 103
Vincent Russo
  • 935
  • 1
  • 6
  • 21
9

Proving that BPP is in NP is harder than separating NEXP from BPP

That BPP is in NP implies that the polynomial identity problem is in NP, which separates NEXP from P/Poly (BPP is in P/Poly).

  • 6
    Indeed, proving "BPP is in NP" necessarily "separates BPP from NEXP", because NP and NEXP are separated. But BPP is contained in NP (or in NTIME(..somewhat more than poly..)) is a concrete inclusion I would really like to see proven in the near future. – Dana Moshkovitz Oct 18 '10 at 11:13
9

It seems strange to me that almost all the answers are about computational complexity, while the question asks for problems in all computer science.

To counter-balance a little bit:

Decidability of the dot-depth hierarchy: Given a first-order formula on finite words and an integer $k$, is there an equivalent first-order formula with only $k$ quantifier alternations?

Recent progress has been made, it has been showed decidable for $k=2$ in a 2014 paper by Thomas Place and Marc Zeitoun, but the general problem is still wide open.

Denis
  • 8,843
  • 28
  • 55
8

Can we multiply two arbitrary $n$-bit numbers in $O(n)$ time? There is a trivial lower bound of $\Omega(n)$, but no better lower bound is known. Currently, the asymptotically fastest algorithm is $O(n \log n)$ - a recent breakthrough by Harvey and van der Hoeven 2019. The previous best took time $O(n \log n 2^{\log^* n})$ due to Martin Fürer, building off of the original algorithm by Schönhage–Strassen which ran in $\Theta(n \log n \log \log n)$ time. Regan and Lipton showed that a super-linear lower bound would follow from the Hartmanis-Stearns Real-Time Computability Conjecture.

A recent result of Afshani, Freksen, Kamma, and Larsen established the following conditional lower bound: assuming the network coding conjecture, any constant-degree Boolean circuit for multiplication must have size $\Omega(n\log n)$.

Clement C.
  • 4,461
  • 29
  • 51
Arindam Pal
  • 1,591
  • 12
  • 23
8

The Open Problems Project: http://cs.smith.edu/~orourke/TOPP/

Joseph O'Rourke
  • 3,753
  • 23
  • 41
8

Algebraic dichotomy conjecture (Bulatov, Jeavons and Krokhin): Assuming ETH, every constraint satisfaction problem is either in $P$ or requires $2^{ \Omega(n)}$ time.

Mohammad Al-Turkistany
  • 20,928
  • 5
  • 63
  • 149
  • 3
    Note that the dichotomy only applies to CSPs of a particular form, those consisting of all instances expressible with a fixed set of constraint relations. Moreover, there is a further technical issue, whether every set of constraint relations that is NP-complete can be equivalently represented by a finite set of relations. – András Salamon Nov 13 '10 at 20:46
8

Is Quasi-Polynomial Time in PSPACE?

Tayfun Pay
  • 2,598
  • 2
  • 18
  • 45
7
  • Do a pseudo-randomized numbers generator exist?
rursw1
  • 386
  • 4
  • 8
  • 2
    If one way function exists, then PRG exist. That is a very well known results from late 80s. So this doesn't make it a independent unresolved problem because if you start counting this, then so does most of the cryptographic primitives that can be constructed by OWF. – Jalaj Jan 27 '12 at 05:12
6

Finding natural SampNP-complete distributional problems.

Informally, Samp-NP is the class of NP problems restricted to distributions that are samplable in polynomial time ("On the Theory of Average Case Complexity", Ben-David, Chor, Goldreich and Luby, JCSS 1992, doi:10.1016/0022-0000(92)90019-F). This class aims to capture the complexity of solving NP on real life instances. While it is known that this class has complete problems, we do not know of any natural complete problems for this class. Finding such a problem would yield the first natural problem for which we have good theoretical reasons to believe that it is hard on average.

András Salamon
  • 19,000
  • 3
  • 64
  • 150
Or Meir
  • 5,380
  • 22
  • 33
6

Some open problems in complexity theory lower bounds, together with their relationships, are mapped here.

Manu
  • 7,659
  • 3
  • 37
  • 42
  • nice, useful, detailed, how about more of a summary/abstract on the misc relationships covered (which isnt really in the abstract either) – vzn Mar 03 '14 at 01:07
6

If P != NP, Does the polynomial hierarchy collapse?

(Because if P = NP then it completely collapses, of course)

Mohammad Al-Turkistany
  • 20,928
  • 5
  • 63
  • 149
rursw1
  • 386
  • 4
  • 8
5

Getting an O(1) factor approximation algorithm in polytime for the Maximum Independent Set of Rectangles.

This is one of the biggest open problems in Computational Geometry. Recently, Anna Adamaszek and Andreas Wiese [1] have given a QPTAS for this problem, which shows the existence of a PTAS assuming standard complexity theory conjectures. However even a constant factor approximation is not known yet that can be achieved in polytime. The best known polytime approximation factor is $O(\log \log n)$ [2]. More recently, Abed et al. [3] have given a constant factor approximation based on a conjecture.

[1] http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6686176

[2] http://ttic.uchicago.edu/~cjulia/papers/rectangles-SODA.pdf

[3] http://drops.dagstuhl.de/opus/volltexte/2015/5291/pdf/1.pdf

1

Is $\mathsf{L}=\mathsf{NL}$? What about non-uniform versions?

Kaveh
  • 21,577
  • 8
  • 82
  • 183
Jeigh
  • 317
  • 1
  • 6
1

Conjunctive query containment over bag semantics

In a GoogleFight between two search queries, can one tell if the first query always wins, without looking at the data?


This 1993 question from database theory [1] asks whether it is possible to decide if an SQL query (more precisely, a conjunctive query) always yields at least as many answers as another conjunctive query, over all possible databases. It would be nice to answer this question to help with query optimization.

One can also formalise the question without referring to databases (see also [2]):

Homomorphism Domination
Input: finite relational structures $S$ and $S'$.
Question: is it true that for any finite relational structure $T$, there are at least as many relational structure homomorphisms from $S$ to $T$ as there are from $S'$ to $T$?

Since the quantification is over an infinite set of structures $T$, this may be undecidable. It is known to be NP-hard [3] as a special case of a more general question over positive semirings (the non-negative integers with addition and multiplication is a semiring); $\Pi^P_2$-hardness was claimed two decades ago but remains unclear. If instead of conjunctive queries, slightly more general queries are allowed, then the problem does become undecidable, via reductions from Hilbert's 10th problem [4,5].

What makes this question interesting is that for most positive semirings of interest the general question is decidable, and actually in $\Pi_2^P$. In fact, for the Boolean semiring case the question becomes: given two conjunctive queries, is it always true that when the first query has an answer then so does the second? Ashok Chandra and Philip Merlin showed in 1977 [6] that this is equivalent to checking whether there exists a homomorphism between the queries, which is in NP. Moreover, in typical databases the queries are usually small or even fixed, while the data is large and changes frequently. This means that even brute force search for a homomorphism between the input queries may be worthwhile.

So it might be a good idea to look quite closely at two small fixed conjunctive queries to decide which is the better one to use. Yet we don't know if such queries can be compared based on the number of answers they generate.


Edit: added some key references as requested by Sylvain.

  1. Surajit Chaudhuri and Moshe Y. Vardi, Optimization of Real Conjunctive Queries, PODS 1993, 59–70. doi:10.1145/153850.153856
  2. Swastik Kopparty and Benjamin Rossman, The homomorphism domination exponent, EJOC 2011, 32(7) 1097–1114. doi:10.1016/j.ejc.2011.03.009
  3. Egor V. Kostylev, Juan L. Reutter, and András Z. Salamon, Classification of Annotation Semirings over Containment of Conjunctive Queries, TODS 2014, 39(1) 1:1–39. doi:10.1145/2556524
  4. Yannis E. Ioannidis and Raghu Ramakrishnan, Containment of conjunctive queries: beyond relations as sets, TODS 1995, 20(3) 288–324. doi:10.1145/211414.211419
  5. T.S. Jayram, Phokion G. Kolaitis, and Erik Vee, The containment problem for Real conjunctive queries with inequalities, PODS 2006, 80–89. doi:10.1145/1142351.1142363
  6. Ashok K. Chandra and Philip M. Merlin, Optimal Implementation of Conjunctive Queries in Relational Data Bases, STOC 1977, 77–90. doi:10.1145/800105.803397
András Salamon
  • 19,000
  • 3
  • 64
  • 150
  • Nice one. Could you provide more links to the literature, on decidable cases etc. ? – Sylvain Feb 28 '14 at 21:44
  • @Sylvain: as is usual here, I have avoided linking directly to my own work but our TODS 2014 paper has all the key links. – András Salamon Feb 28 '14 at 23:36
  • Since I'm not bound by such usage habits, here is a link for others who might be interested: http://tods.acm.org/accepted/2013/SalamonClassification.pdf. You could however provide a link to the PODS 1993 Chaudhuri & Vardi paper at http://dx.doi.org/10.1145/153850.153856 without being suspected of self-promotion. – Sylvain Mar 01 '14 at 15:52
1

In general, what is the relationship between time and space complexity classes?

There are many unresolved questions such as:

  • Is $PTIME = NLOGSPACE$?

  • Is $PTIME = DLOGSPACE$?

  • Is $PTIME = PSPACE$?

  • Is $DSPACE(s(n)) \subseteq PTIME$ for some $s(n) = \omega(\log n)$?

  • Is $DTIME(t(n)) \subseteq PSPACE$ for some super polynomial function $t(n)$?

Also, there are several more specific questions coming out of classic research papers that were never fully answered:

  • Is $NSPACE(k \log(n)) \subseteq DTIME(n^{k - \varepsilon})$ for any $k$ and $\varepsilon > 0$? (see conjecture from Kasai and Iwata 1985)

  • Is $DTIME(n) \subseteq DTISP(poly(n), \frac{n}{\log n})$? (simulation from Hopcroft, Paul, Valiant 1977 only seems to work with super polynomial time complexity)

  • Is $NSPACE(\log(n)) \subseteq DTISP(poly(n), \log^2 n)$? (simulation from Savitch's theorem only seems to work with super polynomial time complexity)

Michael Wehar
  • 5,127
  • 1
  • 24
  • 54
0

Can inductive-recursive types be constructed in univalent models of type theory?

Recently, Hugunin (2019) showed that inductive-inductive types are compatible with univalent models of type theory, and demonstrated how to construct them in cubical type theory. The case for inductive-recursive types, by far, is still unclear.

Some more open problems in type theory here.

xrq
  • 1,175
  • 6
  • 17