The Power of Subsampling in Submodular Maximization

Christopher Harshaw, Ehsan Kazemi, Moran Feldman, Amin Karbasi

Research output: Contribution to journalArticlepeer-review

Abstract

We propose subsampling as a unified algorithmic technique for submodular maximization in centralized and online settings. The idea is simple: independently sample elements from the ground set and use simple combinatorial techniques (such as greedy or local search) on these sampled elements. We show that this approach leads to optimal/state-of-the-art results despite being much simpler than existing methods. In the usual off-line setting, we present SAMPLEGREEDY, which obtains a (p + 2 + o(1))-approximation for maximizing a submodular function subject to a p-extendible system using O(n + nk=p) evaluation and feasibility queries, where k is the size of the largest feasible set. The approximation ratio improves to p + 1 and p for monotone submodular and linear objectives, respectively. In the streaming setting, we present SAMPLE-STREAMING, which obtains a (4p + 2 − o(1))-approximation for maximizing a submodular function subject to a p-matchoid using O(k) memory and O(km=p) evaluation and feasibility queries per element, and m is the number of matroids defining the p-matchoid. The approximation ratio improves to 4p for monotone submodular objectives. We empirically demonstrate the effectiveness of our algorithms on video summarization, location summarization, and movie recommendation tasks.

Original languageEnglish
Article number2
Pages (from-to)1365-1393
Number of pages29
JournalMathematics of Operations Research
Volume47
Issue number2
Early online date13 Oct 2021
DOIs
StatePublished - May 2022
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2021 INFORMS

Keywords

  • approximation algorithms
  • p-extendible systems
  • p-matchoids
  • streaming algorithms
  • submodular maximization
  • subsampling

Fingerprint

Dive into the research topics of 'The Power of Subsampling in Submodular Maximization'. Together they form a unique fingerprint.

Cite this