无法在 M1 Mac 上运行 Selenium Chromedriver

问题描述 投票:0回答:2

我能够在配备 Intel 芯片的 Mac 上运行以下 Docker 文件,但在配备 M1 的 Mac 上运行它时出现错误。然后我尝试了

docker run --init --platform=linux/amd64 -e SPRING_PROFILES_ACTIVE=dev -e SERVER_FLAVOR=LOCAL_DEV -p 8080:8080 monolith-repo
docker buildx build --platform=linux/amd64 -t monolith-repo .
。这让 Docker 容器运行,但在尝试调用 selenium 时出现以下错误:

org.openqa.selenium.SessionNotCreatedException: Could not start a new session. Possible causes are invalid address of the remote server or browser start-up failure.
    at org.openqa.selenium.remote.RemoteWebDriver.execute(RemoteWebDriver.java:561)
    at org.openqa.selenium.remote.RemoteWebDriver.startSession(RemoteWebDriver.java:230)
    at org.openqa.selenium.remote.RemoteWebDriver.<init>(RemoteWebDriver.java:151)
    at org.openqa.selenium.chromium.ChromiumDriver.<init>(ChromiumDriver.java:108)
    at org.openqa.selenium.chrome.ChromeDriver.<init>(ChromeDriver.java:104)
    at org.openqa.selenium.chrome.ChromeDriver.<init>(ChromeDriver.java:91)
    at com.flockta.monolith.scraping.MakeNewWebpageScraperVersion2.getWebDriver(MakeNewWebpageScraperVersion2.java:176)
    at com.flockta.monolith.scraping.MakeNewWebpageScraperVersion2.getWebpage(MakeNewWebpageScraperVersion2.java:52)
    at com.flockta.monolith.job.ScrapingDataPipelineJob.getHtmls(ScrapingDataPipelineJob.java:308)
    at com.flockta.monolith.job.ScrapingDataPipelineJob.processWebPage(ScrapingDataPipelineJob.java:168)
    at com.flockta.monolith.job.ScrapingDataPipelineJob.processResult(ScrapingDataPipelineJob.java:153)
    at com.flockta.monolith.job.ScrapingDataPipelineJob.processResult(ScrapingDataPipelineJob.java:59)
    at com.flockta.monolith.job.AbstractScrapingPipelineJob.processResults(AbstractScrapingPipelineJob.java:57)
    at com.flockta.monolith.job.AbstractScrapingPipelineJob.process(AbstractScrapingPipelineJob.java:30)
    at com.flockta.monolith.ScheduledJobController.runDataJob(ScheduledJobController.java:71)
    at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
    at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:64)
    at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
    at java.base/java.lang.reflect.Method.invoke(Method.java:564)
    at org.springframework.scheduling.support.ScheduledMethodRunnable.run(ScheduledMethodRunnable.java:84)
    at org.springframework.scheduling.support.DelegatingErrorHandlingRunnable.run(DelegatingErrorHandlingRunnable.java:54)
    at java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515)
    at java.base/java.util.concurrent.FutureTask.runAndReset(FutureTask.java:305)
    at java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:305)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1130)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:630)
    at java.base/java.lang.Thread.run(Thread.java:832)
Caused by: org.openqa.selenium.WebDriverException: Driver server process died prematurely.
Build info: version: '4.1.1', revision: 'e8fcc2cecf'
System info: host: '1c789f0433ca', ip: '172.17.0.3', os.name: 'Linux', os.arch: 'amd64', os.version: '5.10.76-linuxkit', java.version: '15.0.2'
Driver info: driver.version: ChromeDriver
    at org.openqa.selenium.remote.service.DriverService.start(DriverService.java:226)
    at org.openqa.selenium.remote.service.DriverCommandExecutor.execute(DriverCommandExecutor.java:98)
    at org.openqa.selenium.remote.RemoteWebDriver.execute(RemoteWebDriver.java:543)
    ... 26 common frames omitted

我的完整 Docker 文件是:

FROM maven:3.6.3-openjdk-15

#Chrome
ARG CHROME_VERSION=98.0.4758.102-1
ADD google-chrome.repo /etc/yum.repos.d/google-chrome.repo
RUN microdnf install -y google-chrome-stable-$CHROME_VERSION \
    && sed -i 's/"$HERE\/chrome"/"$HERE\/chrome" --no-sandbox/g' /opt/google/chrome/google-chrome

## ChromeDriver

ARG CHROME_DRIVER_VERSION=98.0.4758.102
RUN microdnf install -y unzip \
    && curl -s -o /tmp/chromedriver.zip https://chromedriver.storage.googleapis.com/$CHROME_DRIVER_VERSION/chromedriver_linux64.zip \
    && unzip /tmp/chromedriver.zip -d /opt \
    && rm /tmp/chromedriver.zip \
    && mv /opt/chromedriver /opt/chromedriver-$CHROME_DRIVER_VERSION \
    && chmod 755 /opt/chromedriver-$CHROME_DRIVER_VERSION \
    && ln -s /opt/chromedriver-$CHROME_DRIVER_VERSION /usr/bin/chromedriver

ENV CHROMEDRIVER_PORT 4444
ENV CHROMEDRIVER_WHITELISTED_IPS "127.0.0.1"
ENV CHROMEDRIVER_URL_BASE ''
EXPOSE 4444

EXPOSE 8080
ARG JAR_FILE=target/*.jar
COPY ${JAR_FILE} app.jar
ENTRYPOINT ["java","-jar", "-Xmx600m","/app.jar"]

另外,当我尝试在没有

buildx build --platform=linux/amd64
的情况下运行 docker build 时,我收到错误:

docker build -t monolith-repo .
[+] Building 12.0s (7/9)                                                                                                                                                                                                
 => [internal] load build definition from Dockerfile                                                                                                                                                               0.0s
 => => transferring dockerfile: 37B                                                                                                                                                                                0.0s
 => [internal] load .dockerignore                                                                                                                                                                                  0.0s
 => => transferring context: 2B                                                                                                                                                                                    0.0s
 => [internal] load metadata for docker.io/library/maven:3.6.3-openjdk-15                                                                                                                                          0.3s
 => [internal] load build context                                                                                                                                                                                  0.0s
 => => transferring context: 122B                                                                                                                                                                                  0.0s
 => [1/5] FROM docker.io/library/maven:3.6.3-openjdk-15@sha256:aac64d9d716f5fa3926e6c8f43c680fa8404faae0b8a014c0c9b3d73d2d0f66a                                                                                    0.0s
 => CACHED [2/5] ADD google-chrome.repo /etc/yum.repos.d/google-chrome.repo                                                                                                                                        0.0s
 => ERROR [3/5] RUN microdnf install -y google-chrome-stable-98.0.4758.102-1  && sed -i 's/"$HERE\/chrome"/"$HERE\/chrome" --no-sandbox/g' /opt/google/chrome/google-chrome                                       11.6s
------                                                                                                                                                                                                                  
 > [3/5] RUN microdnf install -y google-chrome-stable-98.0.4758.102-1   && sed -i 's/"$HERE\/chrome"/"$HERE\/chrome" --no-sandbox/g' /opt/google/chrome/google-chrome:                                                  
#7 0.232 Downloading metadata...                                                                                                                                                                                        
#7 5.286 Downloading metadata...
#7 9.705 Downloading metadata...
#7 11.53 error: Could not depsolve transaction; 1 problem detected:
#7 11.53  Problem: conflicting requests
#7 11.53   - package google-chrome-stable-98.0.4758.102-1.x86_64 does not have a compatible architecture
#7 11.53   - nothing provides libm.so.6(GLIBC_2.2.5)(64bit) needed by google-chrome-stable-98.0.4758.102-1.x86_64
#7 11.53   - nothing provides ld-linux-x86-64.so.2(GLIBC_2.2.5)(64bit) needed by google-chrome-stable-98.0.4758.102-1.x86_64
#7 11.53   - nothing provides libpthread.so.0(GLIBC_2.2.5)(64bit) needed by google-chrome-stable-98.0.4758.102-1.x86_64
#7 11.53   - nothing provides libdl.so.2(GLIBC_2.2.5)(64bit) needed by google-chrome-stable-98.0.4758.102-1.x86_64
#7 11.53   - nothing provides librt.so.1(GLIBC_2.2.5)(64bit) needed by google-chrome-stable-98.0.4758.102-1.x86_64
#7 11.53   - nothing provides libpthread.so.0(GLIBC_2.3.2)(64bit) needed by google-chrome-stable-98.0.4758.102-1.x86_64
#7 11.53   - nothing provides libpthread.so.0(GLIBC_2.12)(64bit) needed by google-chrome-stable-98.0.4758.102-1.x86_64
#7 11.53   - nothing provides libpthread.so.0(GLIBC_2.3.4)(64bit) needed by google-chrome-stable-98.0.4758.102-1.x86_64
#7 11.53   - nothing provides ld-linux-x86-64.so.2(GLIBC_2.3)(64bit) needed by google-chrome-stable-98.0.4758.102-1.x86_64
#7 11.53   - nothing provides ld-linux-x86-64.so.2()(64bit) needed by google-chrome-stable-98.0.4758.102-1.x86_64
#7 11.53   - nothing provides libpthread.so.0(GLIBC_2.3.3)(64bit) needed by google-chrome-stable-98.0.4758.102-1.x86_64
------
executor failed running [/bin/sh -c microdnf install -y google-chrome-stable-$CHROME_VERSION    && sed -i 's/"$HERE\/chrome"/"$HERE\/chrome" --no-sandbox/g' /opt/google/chrome/google-chrome]: exit code: 1

我在构建过程中注意到两件事:

  1. “软件包 google-chrome-stable-98.0.4758.102-1.x86_64 没有兼容的架构”
  2. 我正在使用'chromedriver_linux64.zip(但它永远不会到达那个阶段),尽管https://chromedriver.storage.googleapis.com/显示有一个chromedriver_mac64_m1很弱,

有没有办法让 Chrome 在我的本地计算机上运行?具体来说,我需要能够在我的 Mac 上运行它并将其部署到 AWS。我认为AWS可以通过

buildx build --platform=linux/amd64
解决,但我不知道如何让它在本地运行。有什么想法吗?

docker selenium google-chrome selenium-chromedriver apple-m1
2个回答
7
投票

最重要的问题是,在

amd
(英特尔)上运行的 Linux 可能尚无法在
arm
上使用。具体请参阅 https://github.com/SeleniumHQ/docker-selenium/issues/1076 以及 jamesmortensen 在 https://hub.docker.com/u/seleniarm 存储库中所做的出色工作(特别是 https://hub.docker.com/r/seleniarm/standalone-chromium/tags)。要使用它,请执行以下操作:

FROM seleniarm/standalone-chromium:4.1.1-alpha-20220119

ENV CHROMEDRIVER_PORT 4444
ENV CHROMEDRIVER_WHITELISTED_IPS "127.0.0.1"
ENV CHROMEDRIVER_URL_BASE ''
EXPOSE 4444

EXPOSE 8080
EXPOSE 5005
ARG JAR_FILE=target/*.jar
COPY ${JAR_FILE} app.jar
# For Testing
ENTRYPOINT ["java","-jar", "-Xmx600m","/app.jar"]

Java代码是:

return new ChromeDriver(service, getChromeOptions());

private ChromeOptions getChromeOptions() {
        ChromeOptions chromeOptions = new ChromeOptions();
        // User agent is required because some websites will reject your request if it does not have a user agent
        chromeOptions.addArguments(String.format("user-agent=%s", USER_AGENT));
        chromeOptions.addArguments("--log-level=OFF");
        chromeOptions.setHeadless(true);
        List<String> arguments = new LinkedList<>();
        arguments.add("--disable-extensions");
        arguments.add("--headless");
        arguments.add("--disable-gpu");
        arguments.add("--no-sandbox");
        arguments.add("--incognito");
        arguments.add("--disable-application-cache");
        arguments.add("--disable-dev-shm-usage");
        chromeOptions.addArguments(arguments);
        return chromeOptions;
    }

请注意,独立版是 chromium ,而不是 chromedriver ,但这可以工作,因为 chromedriver 基于 chromium 。

其根本原因是许多软件包(例如 https://www.ubuntuupdates.org/package/google_chrome/stable/main/base/google-chrome-stable)还没有

arm
版本(他们只有
amd
基于英特尔的版本)。

对于 Docker 文件,我建议现在有两个 docker 文件(很像 zwbetz-gh 在 12 月 28 日的评论,请参阅 https://github.com/SeleniumHQ/docker-selenium/issues/1076)。要构建

arn
版本,您需要执行以下操作:
docker build -f DOCKER_FILE_ARN -t your_tag
。虽然我还需要测试它,但对于非 arn 文件,你会这样做:
docker buildx build --platform=linux/amd64 -f DOCKER_FILE_AMD -t your_tag


0
投票

我们的团队已经放弃在 Mac 上使用 selenium-docker,并已转向 Browserstack 来运行我们的自动化测试。我们尝试了 ARM 映像和常规映像,但没有成功。我觉得 selenium 团队不再支持 ARM 的东西了。

© www.soinside.com 2019 - 2024. All rights reserved.