0%

leetcode Guess Number Higher or Lower II

leetcode Guess Number Higher or Lower II

We are playing the Guess Game. The game is as follows:

I pick a number from 1 to n. You have to guess which number I picked.

Every time you guess wrong, I'll tell you whether the number I picked is higher or lower.

However, when you guess a particular number x, and you guess wrong, you pay $x. You win the game when you guess the number I picked.

Example:

1
2
3
4
5
6
7
8
9
n = 10, I pick 8.

First round: You guess 5, I tell you that it's higher. You pay $5.
Second round: You guess 7, I tell you that it's higher. You pay $7.
Third round: You guess 9, I tell you that it's lower. You pay $9.

Game over. 8 is the number I picked.

You end up paying $5 + $7 + $9 = $21.

Given a particular n ≥ 1, find out how much money you need to have to guarantee a win.

Hint:

  1. The best strategy to play the game is to minimize the maximum loss you could possibly face. Another strategy is to minimize the expected loss. Here, we are interested in the first scenario.
  2. Take a small example (n = 3). What do you end up paying in the worst case?
  3. Check out this article if you're still stuck.
  4. The purely recursive implementation of minimax would be worthless for even a small n. You MUST use dynamic programming.
  5. As a follow-up, how would you modify your code to solve the problem of minimizing the expected loss, instead of the worst-case loss?

题目地址: leetcode Guess Number Higher or Lower II

题意:给定一个数字n,从1~n中选取一个数字y让你猜。在猜的过程中,会告诉你你当前猜的数字x是比y大还是小,猜错的话,需要付$x。给定n,我们最少需要多少的钱来确保我们猜出y呢?

思路:在 374 Guess Number Higher or Lower 中,我们采用二分法来进行查找,但这题并不是用二分法。

这题要求我们在猜测数字y未知的情况下(1~n任意一个数),要我们在最坏情况下我们支付最少的钱。也就是说要考虑所有y的情况。

我们假定选择了一个错误的数x,(1<=x<=n && x!=y )那么就知道接下来应该从[1,x-1 ] 或者[x+1,n]中进行查找。 假如我们已经解决了[1,x-1] 和 [x+1,n]计算问题,我们将其表示为solve(L,x-1) 和solve(x+1,n),那么我们应该选择max(solve(L,x-1),solve(x+1,n)) 这样就是求最坏情况下的损失。总的损失就是f(x) = x + max(solve(L,x-1),solve(x+1,n))

那么将x从1~n进行遍历,取使得 f(x) 达到最小,来确定最坏情况下最小的损失,也就是我们初始应该选择哪个数。

上面的说法其实是一个自顶向下的过程(Top-down),可以用递归来解决。很容易得到如下的代码(这里用了记忆化搜索):

C++

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
class Solution {
public:
int getMoneyAmount(int n) {
vector<vector<int>> dp(n + 1, vector<int>(n + 1, 0));
return solve(dp, 1, n);
}

int solve(vector<vector<int>> &dp, int L, int R) {
if (L >= R) return 0;
if (dp[L][R]) return dp[L][R];
dp[L][R] = 0x7FFFFFFF;
for (int i = L; i <= R; i++) {
dp[L][R] = min(dp[L][R], i + max(solve(dp,L,i-1),solve(dp,i+1,R)));
}
return dp[L][R];
}
};

Java

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
public class Solution {
public int getMoneyAmount(int n) {
int[][] dp = new int[n+1][n+1];
return solve(dp, 1, n);
}
int solve(int[][] dp, int L, int R) {
if (L >= R) return 0;
if (dp[L][R] != 0) return dp[L][R];
dp[L][R] = 0x7FFFFFFF;
for (int i = L; i <= R; i++) {
dp[L][R] = Math.min(dp[L][R], i + Math.max(solve(dp,L,i-1),solve(dp,i+1,R)));
}
return dp[L][R];
}
}

Python

1
2
3
4
5
6
7
8
9
10
11
12
13
14
class Solution(object):
def getMoneyAmount(self, n):
"""
:type n: int
:rtype: int
"""
dp = [[0] * (n + 1) for _ in range(n + 1)]
return self.solve(dp, 1, n)

def solve(self, dp, L, R):
if L >= R: return 0
if dp[L][R]: return dp[L][R]
dp[L][R] = min(i + max(self.solve(dp, L, i - 1), self.solve(dp, i + 1, R)) for i in range(L, R + 1))
return dp[L][R]

 

打印路径

我们可以将路径打印出来,来看看具体的选择策略的过程,以便加深理解:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
class Solution(object):
def getMoneyAmount(self, n):
"""
:type n: int
:rtype: int
"""
dp = [[0] * (n + 1) for _ in range(n + 1)]
self.solve(dp, 1, n)
self.print_path(dp, 1, n)
return dp[1][n]

def solve(self, dp, L, R):
if L >= R: return 0
if dp[L][R]: return dp[L][R]
dp[L][R] = min(i + max(self.solve(dp, L, i - 1), self.solve(dp, i + 1, R)) for i in range(L, R + 1))
return dp[L][R]

def print_path(self, dp, L, R):
if L >= R: return
for i in range(L, R + 1):
if dp[L][R] == i + max(dp[L][i - 1], dp[i + 1][R]):
print i
if dp[L][i - 1] > dp[i + 1][R]:
self.print_path(dp, L, i - 1)
else:
self.print_path(dp, i + 1, R)
break

比如n = 10的时候, 上述程序输出7,9 说明第一步选7第二步选9这是最坏的情况。

有没有更坏的情况呢?答案是没有。7的右半边[8,10]最坏情况就是9了。7的左半边[1,6]可以选个3,然后选个5达到最坏的情况。3+5=8<9

迭代的方法

在记忆化搜索的解法中,c++ 88ms, java 15ms, 而Python则高达1800ms,差点TLE,确实效率不高。

于是将其改为迭代如下:

C++ 56ms

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
class Solution {
public:
int getMoneyAmount(int n) {
vector<vector<int>> dp(n + 1, vector<int>(n + 1, 0));
for (int L = n - 1; L > 0; L--) {
for (int R = L + 1; R <= n; R++) {
dp[L][R] = 0x7FFFFFFF; //INT_MAX
for (int i = L; i < R; i++) {
dp[L][R] = min(dp[L][R], i + max(dp[L][i - 1], dp[i + 1][R]));
}
}
}
return dp[1][n];
}
};

Java 14ms

1
2
3
4
5
6
7
8
9
10
11
12
13
14
public class Solution {
public int getMoneyAmount(int n) {
int[][] dp = new int[n+1][n+1];
for (int L = n - 1; L > 0; L--) {
for (int R = L + 1; R <= n; R++) {
dp[L][R] = 0x7FFFFFFF; //INT_MAX
for (int i = L; i < R; i++) {
dp[L][R] = Math.min(dp[L][R], i + Math.max(dp[L][i - 1], dp[i + 1][R]));
}
}
}
return dp[1][n];
}
}

Python 576ms

1
2
3
4
5
6
7
8
9
10
11
class Solution(object):
def getMoneyAmount(self, n):
"""
:type n: int
:rtype: int
"""
dp = [[0] * (n + 1) for _ in range(n + 1)]
for L in range(n - 1, 0, -1):
for R in range(L + 1, n + 1):
dp[L][R] = min(i + max(dp[L][i - 1], dp[i + 1][R]) for i in range(L, R))
return dp[1][n]

 

本题是leetcode 375 Guess Number Higher or Lower II 题解

更多题解可以查看:https://www.hrwhisper.me/leetcode-algorithm-solution/

请我喝杯咖啡吧~