Skip to content
This repository was archived by the owner on Jun 6, 2024. It is now read-only.

Commit d60cba4

Browse files
authored
Add release note for v1.8.0 (#5564)
1 parent ba25509 commit d60cba4

File tree

6 files changed

+38
-22
lines changed

6 files changed

+38
-22
lines changed

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@
66
[![Join the chat at https://gitter.im/Microsoft/pai](https://badges.gitter.im/Microsoft/pai.svg)](https://gitter.im/Microsoft/pai?utm_source=badge&utm_medium=badge&utm_campaign=pr-badge&utm_content=badge)
77
[![Version](https://img.shields.io/github/release/Microsoft/pai.svg)](https://github.com/Microsoft/pai/releases/latest)
88

9-
**OpenPAI [v1.7.0](./RELEASE_NOTE.md#April-2021-version-170) has been released!**
9+
**OpenPAI [v1.8.0](./RELEASE_NOTE.md#July-2021-version-180) has been released!**
1010

1111
With the release of v1.0, OpenPAI is switching to a more robust, more powerful and lightweight architecture. OpenPAI is also becoming more and more modular so that the platform can be easily customized and expanded to suit new needs. OpenPAI also provides many AI user-friendly features, making it easier for end users and administrators to complete daily AI tasks.
1212

RELEASE_NOTE.md

Lines changed: 17 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,21 @@
11
# OpenPAI Release Note
22

3+
## July 2021 (version 1.8.0)
4+
5+
- Marketplace related update
6+
- Please see [Marketplace](https://github.com/microsoft/openpaimarketplace/releases/tag/v1.8.0) for more details
7+
8+
- Alert manager
9+
- Send alert to users when job status changed #5337
10+
11+
- Webportal
12+
- Support UX of Job Priority #5417
13+
14+
- Others
15+
- Customizable Autoscaler #5412
16+
- Add custom ssl port support #5386
17+
- Clean up repo. Remove obsolete code #5489
18+
319
## April 2021 (version 1.7.0)
420

521
- Marketplace related update
@@ -11,7 +27,7 @@
1127
- In new submission page, the sidebar can be shrank to give the main area more visual space.
1228
- The new submission page moves the yaml editor into a single page, which allows user to focus on setting config or editing yaml protocol.
1329
- The new submission page improves the responsive design in small and medium resolution.
14-
30+
1531
> Know Issue: Tensorboard tool is not implemented in the new submission page yet. If you need to use it, please use the old version.
1632
1733
- Alert system enhancement

contrib/kubespray/config/config.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
user: forexample
22
password: forexample
3-
docker_image_tag: v1.7.0
3+
docker_image_tag: v1.8.0
44

55
# Optional
66

docs/manual/cluster-admin/installation-guide.md

Lines changed: 12 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ To install OpenPAI >= `v1.0.0`, please first check [Installation Requirements](#
88

99
The deployment of OpenPAI requires you to have **at least 3 separate machines**: one dev box machine, one master machine, and one worker machine.
1010

11-
Dev box machine controls masters and workers through SSH during installation, maintenance, and uninstallation. There should be one, and only one dev box.
11+
Dev box machine controls masters and workers through SSH during installation, maintenance, and uninstallation. There should be one, and only one dev box.
1212

1313
The master machine is used to run core Kubernetes components and core OpenPAI services. Currently, OpenPAI does not support high availability and you can only specify one master machine.
1414

@@ -27,7 +27,7 @@ We recommend you to use CPU-only machines for dev box and master. The detailed r
2727
<td>Dev Box Machine</td>
2828
<td>
2929
<ul>
30-
<li>It can communicate with all other machines (master and worker machines).</li>
30+
<li>It can communicate with all other machines (master and worker machines).</li>
3131
<li>It is separate from the cluster which contains the master machine and worker machines.</li>
3232
<li>It can access the internet, especially needs to have access to the docker hub registry service or its mirror. Deployment process will pull Docker images.</li>
3333
</ul>
@@ -38,7 +38,7 @@ We recommend you to use CPU-only machines for dev box and master. The detailed r
3838
<li>SSH service is enabled.</li>
3939
<li>Passwordless ssh to all other machines (master and worker machines).</li>
4040
<li>Docker is installed.</li>
41-
</ul>
41+
</ul>
4242
</td>
4343
</tr>
4444
<tr>
@@ -66,16 +66,16 @@ We recommend you to use CPU-only machines for dev box and master. The detailed r
6666

6767
The worker machines are used to run jobs. You can use multiple workers during installation.
6868

69-
We support various types of workers: CPU workers, GPU workers, and workers with other computing devices (e.g. TPU, NPU).
69+
We support various types of workers: CPU workers, GPU workers, and workers with other computing devices (e.g. TPU, NPU).
7070

7171
At the same time, we also support two schedulers: the Kubernetes default scheduler, and [hivedscheduler](https://github.com/microsoft/hivedscheduler).
7272

73-
Hivedscheduler is the default for OpenPAI. It supports virtual cluster division, topology-aware resource guarantee, and optimized gang scheduling, which are not supported in the k8s default scheduler.
73+
Hivedscheduler is the default for OpenPAI. It supports virtual cluster division, topology-aware resource guarantee, and optimized gang scheduling, which are not supported in the k8s default scheduler.
7474

7575

7676
For now, the support for CPU/NVIDIA GPU workers and workers with other computing device is different:
7777

78-
- For CPU workers and NVIDIA GPU workers, both k8s default scheduler and hived scheduler can be used.
78+
- For CPU workers and NVIDIA GPU workers, both k8s default scheduler and hived scheduler can be used.
7979
- For workers with other types of computing devices (e.g. TPU, NPU), currently, we only support the usage of the k8s default scheduler. You can only include workers with the same computing device in the cluster. For example, you can use TPU workers, but all workers should be TPU workers. You cannot use TPU workers together with GPU workers in one cluster.
8080

8181
Please check the following requirements for different types of worker machines:
@@ -116,7 +116,7 @@ Please check the following requirements for different types of worker machines:
116116
<ul>
117117
<li><b>NVIDIA GPU Driver is installed.</b> You may use <a href="./installation-faqs-and-troubleshooting.html#how-to-check-whether-the-gpu-driver-is-installed">a command</a> to check it. Refer to <a href="./installation-faqs-and-troubleshooting.html#how-to-install-gpu-driver">the installation guidance</a> in FAQs if the driver is not successfully installed. If you are wondering which version of GPU driver you should use, please also refer to <a href="./installation-faqs-and-troubleshooting.html#which-version-of-nvidia-driver-should-i-install">FAQs</a>.</li>
118118
<li><b><a href="https://github.com/NVIDIA/nvidia-container-runtime">nvidia-container-runtime</a> is installed. And be configured as the default runtime of docker.</b> Please configure it in <a href="https://docs.docker.com/config/daemon/#configure-the-docker-daemon">docker-config-file (daemon.json)</a>, instead of in the systemd's config. You can use command <code>sudo docker run --rm nvidia/cuda:10.0-base nvidia-smi</code> to check it. This command should output information of available GPUs if it is setup properly. Refer to <a href="./installation-faqs-and-troubleshooting.html#how-to-install-nvidia-container-runtime">the installation guidance</a> if it is not successfully set up. We don't recommend to use <code>nvidia-docker2</code>. For a detailed comparison between <code>nvidia-container-runtime</code> and <code>nvidia-docker2</code>, please refer to <a href="https://github.com/NVIDIA/nvidia-docker/issues/1268#issuecomment-632692949">here</a>. </li>
119-
</ul>
119+
</ul>
120120
</td>
121121
</tr>
122122
<tr>
@@ -139,7 +139,7 @@ Please check the following requirements for different types of worker machines:
139139
<li>The driver of the device is installed.</li>
140140
<li>The container runtime of the device is installed. And be configured as the default runtime of docker. Please configure it in <a href="https://docs.docker.com/config/daemon/#configure-the-docker-daemon">docker-config-file</a>, because systemd's env will be overwritten during installation.</li>
141141
<li>You should have a deployable <a href="https://kubernetes.io/docs/concepts/extend-kubernetes/compute-storage-net/device-plugins/">device plugin</a> of the computing device. After the Kubernetes is set up, you should manually deploy it in cluster. </li>
142-
</ul>
142+
</ul>
143143
</td>
144144
</tr>
145145
</tbody>
@@ -163,7 +163,7 @@ cd pai
163163
Choose a version to install by checkout to a specific tag:
164164

165165
```bash
166-
git checkout v1.7.0
166+
git checkout v1.8.0
167167
```
168168

169169
Please edit `layout.yaml` and a `config.yaml` file under `<pai-code-dir>/contrib/kubespray/config` folder.
@@ -220,7 +220,7 @@ machine-list:
220220
``` yaml
221221
user: forexample
222222
password: forexample
223-
docker_image_tag: v1.7.0
223+
docker_image_tag: v1.8.0
224224
225225
# Optional
226226
@@ -236,7 +236,7 @@ docker_image_tag: v1.7.0
236236
# docker_cache_azure_container_name: "dockerregistry"
237237
# docker_cache_fs_mount_path: "/var/lib/registry"
238238
# docker_cache_remote_url: "https://registry-1.docker.io"
239-
# docker_cache_htpasswd: ""
239+
# docker_cache_htpasswd: ""
240240
# enable_marketplace: "true"
241241
242242
#############################################
@@ -362,7 +362,7 @@ You can run the following commands to set up kubectl on your localhost:
362362
ansible-playbook -i ${HOME}/pai-deploy/kubespray/inventory/pai/hosts.yml set-kubectl.yml --ask-become-pass
363363
```
364364

365-
By default, we don't set up `kubeconfig` or install `kubectl` client on the dev box machine, but we put the Kubernetes config file in `~/pai-deploy/kube/config`. You can use the config with any Kubernetes client to verify the installation.
365+
By default, we don't set up `kubeconfig` or install `kubectl` client on the dev box machine, but we put the Kubernetes config file in `~/pai-deploy/kube/config`. You can use the config with any Kubernetes client to verify the installation.
366366

367367
Also, you can use the command `ansible-playbook -i ${HOME}/pai-deploy/kubespray/inventory/pai/hosts.yml set-kubectl.yml --ask-become-pass` to set up `kubeconfig` and `kubectl` on the dev box machine. It will copy the config to `~/.kube/config` and set up the `kubectl` client. After it is executed, you can use `kubectl` on the dev box machine directly.
368368

docs_zh_CN/manual/cluster-admin/installation-guide.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@ master机器用于运行核心Kubernetes组件和核心OpenPAI服务。目前,
2727
<td>dev box 机器</td>
2828
<td>
2929
<ul>
30-
<li>它可以与所有其他机器(master和worker机器)通信。</li>
30+
<li>它可以与所有其他机器(master和worker机器)通信。</li>
3131
<li>它是独立于master机器和worker机器之外的一台机器。</li>
3232
<li>它可以访问Internet。尤其是可以访问Docker Hub。部署过程会从Docker Hub拉取Docker镜像。</li>
3333
</ul>
@@ -38,7 +38,7 @@ master机器用于运行核心Kubernetes组件和核心OpenPAI服务。目前,
3838
<li>SSH服务已开启。</li>
3939
<li>可以免密登录所有master和worker机器。</li>
4040
<li>Docker已被正确安装。</li>
41-
</ul>
41+
</ul>
4242
</td>
4343
</tr>
4444
<tr>
@@ -116,7 +116,7 @@ hivedscheduler是OpenPAI的默认调度器,它支持虚拟集群划分,拓
116116
<ul>
117117
<li><b>GPU驱动已被正确安装。</b> 您可以用<a href="./installation-faqs-and-troubleshooting.html#how-to-check-whether-the-gpu-driver-is-installed">这个命令</a>来检查。 如果您的GPU驱动未被正确安装,可以参考<a href="./installation-faqs-and-troubleshooting.html#how-to-install-gpu-driver">如何安装GPU驱动</a>。如果您对安装哪个版本的GPU驱动有疑问,可以阅读<a href="./installation-faqs-and-troubleshooting.html#which-version-of-nvidia-driver-should-i-install">这个文档</a>。</li>
118118
<li><b><a href="https://github.com/NVIDIA/nvidia-container-runtime">nvidia-container-runtime</a>已被正确安装,并且被设置为Docker的默认runtime。</b> 因为systemd的配置会在接下来安装过程中被覆盖,所以请不要在systemd里设置 docker 默认runtime,而是在<a href="https://docs.docker.com/config/daemon/#configure-the-docker-daemon">docker-config-file (daemon.json)</a>里进行设置。 您可以使用命令<code>sudo docker run --rm nvidia/cuda:10.0-base nvidia-smi</code> 来检查这一项。如果该命令成功打出当前可用的显卡个数,就说明设置是没问题的。如果它未被正确安装,请参考<a href="./installation-faqs-and-troubleshooting.html#how-to-install-nvidia-container-runtime">如何安装nvidia container runtime</a>。 我们不推荐您使用<code>nvidia-docker2</code>。 有关 <code>nvidia-container-runtime</code> 和 <code>nvidia-docker2</code> 的详细对比,请参考<a href="https://github.com/NVIDIA/nvidia-docker/issues/1268#issuecomment-632692949">这里</a>。</li>
119-
</ul>
119+
</ul>
120120
</td>
121121
</tr>
122122
<tr>
@@ -139,7 +139,7 @@ hivedscheduler是OpenPAI的默认调度器,它支持虚拟集群划分,拓
139139
<li>设备的驱动已被正确安装</li>
140140
<li>设备的 container runtime 已被正确安装,并且被设置为Docker的默认runtime。因为systemd的配置会在接下来安装过程中被覆盖,所以请不要在systemd里设置 docker 默认runtime,而是在<a href="https://docs.docker.com/config/daemon/#configure-the-docker-daemon">docker-config-file</a>里进行设置。</li>
141141
<li>您需要用一个该设备的<a href="https://kubernetes.io/docs/concepts/extend-kubernetes/compute-storage-net/device-plugins/">device plugin</a>。在Kubernetes安装后,您需要手动将该device plugin部署在集群中。</li>
142-
</ul>
142+
</ul>
143143
</td>
144144
</tr>
145145
</tbody>
@@ -163,7 +163,7 @@ cd pai
163163
checkout到某一个tag,来选择需要安装的OpenPAI版本:
164164

165165
```bash
166-
git checkout v1.7.0
166+
git checkout v1.8.0
167167
```
168168

169169
接下来,请编辑`<pai-code-dir>/contrib/kubespray/config`目录下的`layout.yaml``config.yaml`文件。
@@ -221,7 +221,7 @@ machine-list:
221221
``` yaml
222222
user: forexample
223223
password: forexample
224-
docker_image_tag: v1.7.0
224+
docker_image_tag: v1.8.0
225225
226226
# Optional
227227

version/PAI.VERSION

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
v1.7.0
1+
v1.8.0

0 commit comments

Comments
 (0)