XGBoost - H2O 由于非法内存访问而崩溃

问题描述 投票:0回答:1

使用 XGBoost 进行网格搜索时,H2O 进程崩溃:

terminate called after throwing an instance of 'thrust::system::system_error'
  what():  /tmp/xgboost/plugin/updater_gpu/src/device_helpers.cuh(387): 
an illegal memory access was encountered

给出以下信息后:

08-17 06:44:46.672 10.0.1.89:54321       14426  FJ-1-3    INFO: Checking convergence with logloss metric: 0.04519170911104479 --> 0.02811784326194906 (still improving)
.
08-17 06:44:46.672 10.0.1.89:54321       14426  FJ-1-3    INFO: For grid: final_grid built: 90 models.

Java 异常转储:

08-17 06:44:46.742 10.0.1.89:54321       14426  #12317-18 INFO: GET /99/Grids/final_grid, parms: {}
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR: java.lang.IllegalArgumentException: Field not found: 'col_sample_rate_change_per_level/_col_sample_rate
_change_per_level' on object hex.tree.xgboost.XGBoostModel$XGBoostParameters@49356589
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at water.util.PojoUtils.getFieldValue(PojoUtils.java:562)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at hex.grid.Grid.createSummaryTable(Grid.java:370)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at hex.schemas.GridSchemaV99.fillFromImpl(GridSchemaV99.java:158)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at water.api.GridsHandler.fetch(GridsHandler.java:41)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at sun.reflect.GeneratedMethodAccessor41.invoke(Unknown Source)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at java.lang.reflect.Method.invoke(Method.java:498)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at water.api.Handler.handle(Handler.java:63)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at water.api.RequestServer.serve(RequestServer.java:448)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at water.api.RequestServer.doGeneric(RequestServer.java:297)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at water.api.RequestServer.doGet(RequestServer.java:221)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at javax.servlet.http.HttpServlet.service(HttpServlet.java:735)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at javax.servlet.http.HttpServlet.service(HttpServlet.java:848)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.servlet.ServletHolder.handle(ServletHolder.java:684)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:503)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1086)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:429)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1020)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:135)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:154)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:116)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at water.JettyHTTPD$LoginHandler.handle(JettyHTTPD.java:183)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:503)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1086)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:429)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1020)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:135)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:154)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:116)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at water.JettyHTTPD$LoginHandler.handle(JettyHTTPD.java:183)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:154)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:116)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.server.Server.handle(Server.java:370)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(AbstractHttpConnection.java:49
4)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(BlockingHttpConnection.java:53
)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(AbstractHttpConnection.java:9
71)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.headerComplete(AbstractHttpCo
nnection.java:1033)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:644)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:235)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpConnection.java:72)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(SocketConnector.java:264)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:608)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:543)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:         at java.lang.Thread.run(Thread.java:748)
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR: Caught exception:
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR: ERROR MESSAGE:
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR:
08-17 06:44:46.747 10.0.1.89:54321       14426  #12317-18 ERRR: Field not found: 'col_sample_rate_change_per_level/_col_sample_rate_change_per_level' on object hex.tre
e.xgboost.XGBoostModel$XGBoostParameters@49356589
h2o xgboost
1个回答
2
投票
xgboost 不支持

col_sample_rate_per_level
,仅支持 GBM 和随机森林: http://docs.h2o.ai/h2o/latest-stable/h2o-docs/data-science/algo-params/col_sample_rate_change_per_level.html

以下是 xgboost 网格可以使用的列表:http://docs.h2o.ai/h2o/latest-stable/h2o-docs/grid-search.html#xgboost-hyperparameters

(当然,它应该告诉你这一点,而不是崩溃,所以绝对是一个错误!)

© www.soinside.com 2019 - 2024. All rights reserved.