Spark on Yarn 模式编写workcount实例_学亮编程手记的博客-程序员秘密

技术标签: spark  

Spark on Yarn 模式编写workcount实例

一:上传输入文件到hadoop,本例上传的文件名为spark.txt

二:打包程序,打包名为:Spark-0.0.1-SNAPSHOT.jar

三:上传文件到Spark集群进行部署

import java.util.ArrayList;  
import java.util.List;  
  
import org.apache.spark.SparkConf;  
import org.apache.spark.api.java.JavaRDD;  
import org.apache.spark.api.java.JavaSparkContext;  
import org.apache.spark.api.java.function.Function2;  
import org.apache.spark.api.java.function.PairFlatMapFunction;  
  
import scala.Tuple2;  
  
public class SparkMain {
      
    @SuppressWarnings("serial") public static void main(String[] args) {
      
        SparkConf conf = new SparkConf().setAppName("Spark");  
        /*独立模式 
        conf.setMaster("spark://master56:7077"); 
        conf.set("spark.cores.max", "48"); 
        */  
        /*yarn-client模式*/  
        conf.setMaster("yarn-client");  
        //设置程序包  
        conf.setJars(new String[]{
    "/home/hadoop/Spark-0.0.1-SNAPSHOT/lib/Spark-0.0.1-SNAPSHOT.jar"});  
        //设置SparkHOME  
        conf.setSparkHome("/home/hadoop/spark-1.2.0-cdh5.3.2");  
        //设置运行资源参数  
        conf.set("spark.executor.instances", "30");  
        conf.set("spark.executor.cores", "3");  
        conf.set("spark.executor.memory", "5G");  
        conf.set("spark.driver.memory", "3G");  
        conf.set("spark.driver.maxResultSize", "10G");  
        JavaSparkContext context = new JavaSparkContext(conf);  
        //设置运行资源参数  
        JavaRDD<String> rdd = context.textFile("hdfs://nujhadoop/spark.txt");  
        List<Tuple2<String, Integer>> result = rdd.flatMapToPair(new PairFlatMapFunction<String, String, Integer>(){
      
                @Override  
                public Iterable<Tuple2<String, Integer>> call(String arg0)  
                    throws Exception {
      
                    ArrayList<Tuple2<String, Integer>> list = new ArrayList<Tuple2<String, Integer>>();  
                    String[] array = arg0.split(" ");  
                    for (String temper : array) {
      
                        list.add(new Tuple2<String, Integer>(temper, 1));  
                    }  
                    return list;  
                }  
                  
            }).reduceByKey(new Function2<Integer, Integer, Integer>(){
      
  
                @Override  
                public Integer call(Integer arg0, Integer arg1)  
                    throws Exception {
      
                    // TODO Auto-generated method stub  
                    return arg0 + arg1;  
                }  
                  
            }).collect();  
        //打印结果  
        for (Tuple2<String, Integer> temper : result) {
      
            System.out.println(temper._1+","+temper._2);  
        }  
        context.stop();  
    }  
} 

日志输出:

Using Spark's default log4j profile: org/apache/spark/log4j-defaults.properties  
15/07/15 16:45:10 INFO SecurityManager: Changing view acls to: hadoop  
15/07/15 16:45:10 INFO SecurityManager: Changing modify acls to: hadoop  
15/07/15 16:45:10 INFO SecurityManager: SecurityManager: authentication disabled; ui acls disabled; users with view permissions: Set(hadoop); users with modify permissions: Set(hadoop)  
15/07/15 16:45:11 INFO Slf4jLogger: Slf4jLogger started  
15/07/15 16:45:11 INFO Remoting: Starting remoting  
15/07/15 16:45:11 INFO Remoting: Remoting started; listening on addresses :[akka.tcp://[email protected]:22597]  
15/07/15 16:45:11 INFO Remoting: Remoting now listens on addresses: [akka.tcp://[email protected]:22597]  
15/07/15 16:45:11 INFO Utils: Successfully started service 'sparkDriver' on port 22597.  
15/07/15 16:45:11 INFO SparkEnv: Registering MapOutputTracker  
15/07/15 16:45:11 INFO SparkEnv: Registering BlockManagerMaster  
15/07/15 16:45:11 INFO DiskBlockManager: Created local directory at /tmp/spark-local-20150715164511-17b9  
15/07/15 16:45:11 INFO MemoryStore: MemoryStore started with capacity 1635.9 MB  
15/07/15 16:45:12 WARN NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable  
15/07/15 16:45:12 INFO HttpFileServer: HTTP File server directory is /tmp/spark-bd6a9445-0d51-4d1b-9fc5-b4dcbcdd4cd0  
15/07/15 16:45:12 INFO HttpServer: Starting HTTP Server  
15/07/15 16:45:12 INFO Utils: Successfully started service 'HTTP file server' on port 54673.  
15/07/15 16:45:12 INFO Utils: Successfully started service 'SparkUI' on port 4040.  
15/07/15 16:45:12 INFO SparkUI: Started SparkUI at http://slave63:4040  
15/07/15 16:45:13 INFO SparkContext: Added JAR /home/hadoop/Spark-0.0.1-SNAPSHOT/lib/Spark-0.0.1-SNAPSHOT.jar at http://172.20.10.63:54673/jars/Spark-0.0.1-SNAPSHOT.jar with timestamp 1436949913052  
15/07/15 16:45:13 INFO RMProxy: Connecting to ResourceManager at master46/172.20.10.46:8032  
15/07/15 16:45:13 INFO Client: Requesting a new application from cluster with 30 NodeManagers  
15/07/15 16:45:13 INFO Client: Verifying our application has not requested more than the maximum memory capability of the cluster (8192 MB per container)  
15/07/15 16:45:13 INFO Client: Will allocate AM container, with 3456 MB memory including 384 MB overhead  
15/07/15 16:45:13 INFO Client: Setting up container launch context for our AM  
15/07/15 16:45:13 INFO Client: Preparing resources for our AM container  
15/07/15 16:45:14 INFO Client: Uploading resource file:/home/hadoop/Spark-0.0.1-SNAPSHOT/lib/spark-assembly-1.2.0-cdh5.3.2.jar -> hdfs://nujhadoop/user/hadoop/.sparkStaging/application_1434338096593_8055/spark-assembly-1.2.0-cdh5.3.2.jar  
15/07/15 16:45:15 INFO Client: Setting up the launch environment for our AM container  
15/07/15 16:45:16 INFO SecurityManager: Changing view acls to: hadoop  
15/07/15 16:45:16 INFO SecurityManager: Changing modify acls to: hadoop  
15/07/15 16:45:16 INFO SecurityManager: SecurityManager: authentication disabled; ui acls disabled; users with view permissions: Set(hadoop); users with modify permissions: Set(hadoop)  
15/07/15 16:45:16 INFO Client: Submitting application 8055 to ResourceManager  
15/07/15 16:45:16 INFO YarnClientImpl: Submitted application application_1434338096593_8055  
15/07/15 16:45:17 INFO Client: Application report for application_1434338096593_8055 (state: ACCEPTED)  
15/07/15 16:45:17 INFO Client:   
     client token: N/A  
     diagnostics: N/A  
     ApplicationMaster host: N/A  
     ApplicationMaster RPC port: -1  
     queue: root.hadoop  
     start time: 1436949916087  
     final status: UNDEFINED  
     tracking URL: http://master46:8088/proxy/application_1434338096593_8055/  
     user: hadoop  
15/07/15 16:45:18 INFO Client: Application report for application_1434338096593_8055 (state: ACCEPTED)  
15/07/15 16:45:19 INFO Client: Application report for application_1434338096593_8055 (state: ACCEPTED)  
15/07/15 16:45:20 INFO Client: Application report for application_1434338096593_8055 (state: ACCEPTED)  
15/07/15 16:45:21 INFO Client: Application report for application_1434338096593_8055 (state: ACCEPTED)  
15/07/15 16:45:22 INFO Client: Application report for application_1434338096593_8055 (state: ACCEPTED)  
15/07/15 16:45:22 INFO YarnClientSchedulerBackend: ApplicationMaster registered as Actor[akka.tcp://[email protected]:55325/user/YarnAM#945036977]  
15/07/15 16:45:22 INFO YarnClientSchedulerBackend: Add WebUI Filter. org.apache.hadoop.yarn.server.webproxy.amfilter.AmIpFilter, Map(PROXY_HOSTS -> master46, PROXY_URI_BASES -> http://master46:8088/proxy/application_1434338096593_8055), /proxy/application_1434338096593_8055  
15/07/15 16:45:22 INFO JettyUtils: Adding filter: org.apache.hadoop.yarn.server.webproxy.amfilter.AmIpFilter  
15/07/15 16:45:23 INFO Client: Application report for application_1434338096593_8055 (state: RUNNING)  
15/07/15 16:45:23 INFO Client:   
     client token: N/A  
     diagnostics: N/A  
     ApplicationMaster host: slave28  
     ApplicationMaster RPC port: 0  
     queue: root.hadoop  
     start time: 1436949916087  
     final status: UNDEFINED  
     tracking URL: http://master46:8088/proxy/application_1434338096593_8055/  
     user: hadoop  
15/07/15 16:45:23 INFO YarnClientSchedulerBackend: Application application_1434338096593_8055 has started running.  
15/07/15 16:45:23 INFO NettyBlockTransferService: Server created on 50871  
15/07/15 16:45:23 INFO BlockManagerMaster: Trying to register BlockManager  
15/07/15 16:45:23 INFO BlockManagerMasterActor: Registering block manager slave63:50871 with 1635.9 MB RAM, BlockManagerId(<driver>, slave63, 50871)  
15/07/15 16:45:23 INFO BlockManagerMaster: Registered BlockManager  
15/07/15 16:45:28 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:23892/user/Executor#469935313] with ID 1  
15/07/15 16:45:28 INFO RackResolver: Resolved slave52 to /rack2  
15/07/15 16:45:29 INFO BlockManagerMasterActor: Registering block manager slave52:36246 with 2.6 GB RAM, BlockManagerId(1, slave52, 36246)  
15/07/15 16:45:33 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:19749/user/Executor#-1474529488] with ID 4  
15/07/15 16:45:33 INFO RackResolver: Resolved slave63 to /rack2  
15/07/15 16:45:34 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:51624/user/Executor#1260742083] with ID 28  
15/07/15 16:45:34 INFO RackResolver: Resolved slave28 to /rack3  
15/07/15 16:45:34 INFO BlockManagerMasterActor: Registering block manager slave63:64068 with 2.6 GB RAM, BlockManagerId(4, slave63, 64068)  
15/07/15 16:45:35 INFO BlockManagerMasterActor: Registering block manager slave28:17967 with 2.6 GB RAM, BlockManagerId(28, slave28, 17967)  
15/07/15 16:45:36 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:57756/user/Executor#-1426187042] with ID 16  
15/07/15 16:45:36 INFO RackResolver: Resolved slave23 to /rack3  
15/07/15 16:45:37 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:35348/user/Executor#-1773874771] with ID 3  
15/07/15 16:45:37 INFO RackResolver: Resolved slave36 to /rack1  
15/07/15 16:45:37 INFO BlockManagerMasterActor: Registering block manager slave23:62605 with 2.6 GB RAM, BlockManagerId(16, slave23, 62605)  
15/07/15 16:45:38 INFO BlockManagerMasterActor: Registering block manager slave36:23663 with 2.6 GB RAM, BlockManagerId(3, slave36, 23663)  
15/07/15 16:45:39 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:43551/user/Executor#-576231312] with ID 11  
15/07/15 16:45:39 INFO RackResolver: Resolved slave15 to /rack3  
15/07/15 16:45:40 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:40681/user/Executor#-1501756719] with ID 29  
15/07/15 16:45:40 INFO RackResolver: Resolved slave37 to /rack1  
15/07/15 16:45:40 INFO BlockManagerMasterActor: Registering block manager slave15:55745 with 2.6 GB RAM, BlockManagerId(11, slave15, 55745)  
15/07/15 16:45:41 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:28665/user/Executor#1165917342] with ID 21  
15/07/15 16:45:41 INFO RackResolver: Resolved slave26 to /rack3  
15/07/15 16:45:41 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://sp[email protected]:37653/user/Executor#587407704] with ID 2  
15/07/15 16:45:41 INFO RackResolver: Resolved slave54 to /rack2  
15/07/15 16:45:41 INFO BlockManagerMasterActor: Registering block manager slave37:38747 with 2.6 GB RAM, BlockManagerId(29, slave37, 38747)  
15/07/15 16:45:42 INFO BlockManagerMasterActor: Registering block manager slave26:46197 with 2.6 GB RAM, BlockManagerId(21, slave26, 46197)  
15/07/15 16:45:42 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:64410/user/Executor#-1365579611] with ID 19  
15/07/15 16:45:42 INFO RackResolver: Resolved slave17 to /rack3  
15/07/15 16:45:42 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:15510/user/Executor#972094812] with ID 15  
15/07/15 16:45:42 INFO RackResolver: Resolved slave35 to /rack1  
15/07/15 16:45:42 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:36974/user/Executor#-597250789] with ID 26  
15/07/15 16:45:42 INFO RackResolver: Resolved slave55 to /rack2  
15/07/15 16:45:42 INFO BlockManagerMasterActor: Registering block manager slave54:18807 with 2.6 GB RAM, BlockManagerId(2, slave54, 18807)  
15/07/15 16:45:43 INFO YarnClientSchedulerBackend: SchedulerBackend is ready for scheduling beginning after waiting maxRegisteredResourcesWaitingTime: 30000(ms)  
15/07/15 16:45:43 INFO BlockManagerMasterActor: Registering block manager slave17:58808 with 2.6 GB RAM, BlockManagerId(19, slave17, 58808)  
15/07/15 16:45:43 INFO BlockManagerMasterActor: Registering block manager slave35:29737 with 2.6 GB RAM, BlockManagerId(15, slave35, 29737)  
15/07/15 16:45:43 INFO MemoryStore: ensureFreeSpace(261904) called with curMem=0, maxMem=1715396935  
15/07/15 16:45:43 INFO MemoryStore: Block broadcast_0 stored as values in memory (estimated size 255.8 KB, free 1635.7 MB)  
15/07/15 16:45:43 INFO BlockManagerMasterActor: Registering block manager slave55:29257 with 2.6 GB RAM, BlockManagerId(26, slave55, 29257)  
15/07/15 16:45:43 INFO MemoryStore: ensureFreeSpace(21065) called with curMem=261904, maxMem=1715396935  
15/07/15 16:45:43 INFO MemoryStore: Block broadcast_0_piece0 stored as bytes in memory (estimated size 20.6 KB, free 1635.7 MB)  
15/07/15 16:45:43 INFO BlockManagerInfo: Added broadcast_0_piece0 in memory on slave63:50871 (size: 20.6 KB, free: 1635.9 MB)  
15/07/15 16:45:43 INFO BlockManagerMaster: Updated info of block broadcast_0_piece0  
15/07/15 16:45:43 INFO SparkContext: Created broadcast 0 from textFile at SparkMain.java:31  
15/07/15 16:45:44 INFO FileInputFormat: Total input paths to process : 1  
15/07/15 16:45:44 INFO SparkContext: Starting job: collect at SparkMain.java:53  
15/07/15 16:45:44 INFO DAGScheduler: Registering RDD 2 (flatMapToPair at SparkMain.java:32)  
15/07/15 16:45:44 INFO DAGScheduler: Got job 0 (collect at SparkMain.java:53) with 2 output partitions (allowLocal=false)  
15/07/15 16:45:44 INFO DAGScheduler: Final stage: Stage 1(collect at SparkMain.java:53)  
15/07/15 16:45:44 INFO DAGScheduler: Parents of final stage: List(Stage 0)  
15/07/15 16:45:44 INFO DAGScheduler: Missing parents: List(Stage 0)  
15/07/15 16:45:44 INFO DAGScheduler: Submitting Stage 0 (FlatMappedRDD[2] at flatMapToPair at SparkMain.java:32), which has no missing parents  
15/07/15 16:45:44 INFO MemoryStore: ensureFreeSpace(3672) called with curMem=282969, maxMem=1715396935  
15/07/15 16:45:44 INFO MemoryStore: Block broadcast_1 stored as values in memory (estimated size 3.6 KB, free 1635.7 MB)  
15/07/15 16:45:44 INFO MemoryStore: ensureFreeSpace(2190) called with curMem=286641, maxMem=1715396935  
15/07/15 16:45:44 INFO MemoryStore: Block broadcast_1_piece0 stored as bytes in memory (estimated size 2.1 KB, free 1635.7 MB)  
15/07/15 16:45:44 INFO BlockManagerInfo: Added broadcast_1_piece0 in memory on slave63:50871 (size: 2.1 KB, free: 1635.9 MB)  
15/07/15 16:45:44 INFO BlockManagerMaster: Updated info of block broadcast_1_piece0  
15/07/15 16:45:44 INFO SparkContext: Created broadcast 1 from broadcast at DAGScheduler.scala:838  
15/07/15 16:45:44 INFO DAGScheduler: Submitting 2 missing tasks from Stage 0 (FlatMappedRDD[2] at flatMapToPair at SparkMain.java:32)  
15/07/15 16:45:44 INFO YarnClientClusterScheduler: Adding task set 0.0 with 2 tasks  
15/07/15 16:45:44 INFO RackResolver: Resolved slave38 to /rack1  
15/07/15 16:45:44 INFO TaskSetManager: Starting task 0.0 in stage 0.0 (TID 0, slave63, NODE_LOCAL, 1340 bytes)  
15/07/15 16:45:44 INFO TaskSetManager: Starting task 1.0 in stage 0.0 (TID 1, slave63, NODE_LOCAL, 1340 bytes)  
15/07/15 16:45:45 INFO BlockManagerInfo: Added broadcast_1_piece0 in memory on slave63:64068 (size: 2.1 KB, free: 2.6 GB)  
15/07/15 16:45:45 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:57998/user/Executor#-1382810865] with ID 12  
15/07/15 16:45:45 INFO RackResolver: Resolved slave65 to /rack2  
15/07/15 16:45:45 INFO BlockManagerInfo: Added broadcast_0_piece0 in memory on slave63:64068 (size: 20.6 KB, free: 2.6 GB)  
15/07/15 16:45:46 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:59085/user/Executor#-1064055348] with ID 13  
15/07/15 16:45:46 INFO RackResolver: Resolved slave53 to /rack2  
15/07/15 16:45:46 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:24319/user/Executor#-139207262] with ID 14  
15/07/15 16:45:46 INFO RackResolver: Resolved slave62 to /rack2  
15/07/15 16:45:46 INFO BlockManagerMasterActor: Registering block manager slave65:64372 with 2.6 GB RAM, BlockManagerId(12, slave65, 64372)  
15/07/15 16:45:47 INFO BlockManagerMasterActor: Registering block manager slave62:53823 with 2.6 GB RAM, BlockManagerId(14, slave62, 53823)  
15/07/15 16:45:47 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:28461/user/Executor#-2071109973] with ID 20  
15/07/15 16:45:47 INFO RackResolver: Resolved slave32 to /rack1  
15/07/15 16:45:47 INFO BlockManagerMasterActor: Registering block manager slave53:60055 with 2.6 GB RAM, BlockManagerId(13, slave53, 60055)  
15/07/15 16:45:47 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:35963/user/Executor#148583350] with ID 22  
15/07/15 16:45:47 INFO RackResolver: Resolved slave14 to /rack3  
15/07/15 16:45:48 INFO BlockManagerMasterActor: Registering block manager slave32:35445 with 2.6 GB RAM, BlockManagerId(20, slave32, 35445)  
15/07/15 16:45:48 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:63661/user/Executor#1541284948] with ID 24  
15/07/15 16:45:48 INFO RackResolver: Resolved slave43 to /rack1  
15/07/15 16:45:48 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:46267/user/Executor#-1437439698] with ID 10  
15/07/15 16:45:48 INFO RackResolver: Resolved slave33 to /rack1  
15/07/15 16:45:48 INFO BlockManagerMasterActor: Registering block manager slave43:34953 with 2.6 GB RAM, BlockManagerId(24, slave43, 34953)  
15/07/15 16:45:49 INFO BlockManagerMasterActor: Registering block manager slave14:53473 with 2.6 GB RAM, BlockManagerId(22, slave14, 53473)  
15/07/15 16:45:49 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:26170/user/Executor#794862330] with ID 5  
15/07/15 16:45:49 INFO RackResolver: Resolved slave42 to /rack1  
15/07/15 16:45:49 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:35394/user/Executor#1035079905] with ID 18  
15/07/15 16:45:49 INFO RackResolver: Resolved slave44 to /rack1  
15/07/15 16:45:49 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:52328/user/Executor#1181615525] with ID 30  
15/07/15 16:45:49 INFO RackResolver: Resolved slave16 to /rack3  
15/07/15 16:45:49 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:48403/user/Executor#-1103053012] with ID 27  
15/07/15 16:45:49 INFO RackResolver: Resolved slave13 to /rack3  
15/07/15 16:45:49 INFO BlockManagerMasterActor: Registering block manager slave42:60923 with 2.6 GB RAM, BlockManagerId(5, slave42, 60923)  
15/07/15 16:45:50 INFO BlockManagerMasterActor: Registering block manager slave44:30133 with 2.6 GB RAM, BlockManagerId(18, slave44, 30133)  
15/07/15 16:45:50 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:63922/user/Executor#-917535710] with ID 6  
15/07/15 16:45:50 INFO RackResolver: Resolved slave45 to /rack1  
15/07/15 16:45:50 INFO BlockManagerMasterActor: Registering block manager slave16:21970 with 2.6 GB RAM, BlockManagerId(30, slave16, 21970)  
15/07/15 16:45:50 INFO BlockManagerMasterActor: Registering block manager slave13:57504 with 2.6 GB RAM, BlockManagerId(27, slave13, 57504)  
15/07/15 16:45:50 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:18514/user/Executor#-799832935] with ID 25  
15/07/15 16:45:50 INFO RackResolver: Resolved slave25 to /rack3  
15/07/15 16:45:51 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:64380/user/Executor#-520443684] with ID 9  
15/07/15 16:45:51 INFO RackResolver: Resolved slave27 to /rack3  
15/07/15 16:45:51 INFO BlockManagerMasterActor: Registering block manager slave25:16330 with 2.6 GB RAM, BlockManagerId(25, slave25, 16330)  
15/07/15 16:45:51 INFO BlockManagerMasterActor: Registering block manager slave45:63841 with 2.6 GB RAM, BlockManagerId(6, slave45, 63841)  
15/07/15 16:45:51 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:46357/user/Executor#1463308812] with ID 8  
15/07/15 16:45:51 INFO RackResolver: Resolved slave24 to /rack3  
15/07/15 16:45:51 INFO TaskSetManager: Finished task 1.0 in stage 0.0 (TID 1) in 7633 ms on slave63 (1/2)  
15/07/15 16:45:51 INFO BlockManagerMasterActor: Registering block manager slave33:50916 with 2.6 GB RAM, BlockManagerId(10, slave33, 50916)  
15/07/15 16:45:52 INFO TaskSetManager: Finished task 0.0 in stage 0.0 (TID 0) in 7804 ms on slave63 (2/2)  
15/07/15 16:45:52 INFO DAGScheduler: Stage 0 (flatMapToPair at SparkMain.java:32) finished in 7.810 s  
15/07/15 16:45:52 INFO YarnClientClusterScheduler: Removed TaskSet 0.0, whose tasks have all completed, from pool   
15/07/15 16:45:52 INFO DAGScheduler: looking for newly runnable stages  
15/07/15 16:45:52 INFO DAGScheduler: running: Set()  
15/07/15 16:45:52 INFO DAGScheduler: waiting: Set(Stage 1)  
15/07/15 16:45:52 INFO DAGScheduler: failed: Set()  
15/07/15 16:45:52 INFO DAGScheduler: Missing parents for Stage 1: List()  
15/07/15 16:45:52 INFO DAGScheduler: Submitting Stage 1 (ShuffledRDD[3] at reduceByKey at SparkMain.java:44), which is now runnable  
15/07/15 16:45:52 INFO MemoryStore: ensureFreeSpace(2232) called with curMem=288831, maxMem=1715396935  
15/07/15 16:45:52 INFO MemoryStore: Block broadcast_2 stored as values in memory (estimated size 2.2 KB, free 1635.7 MB)  
15/07/15 16:45:52 INFO MemoryStore: ensureFreeSpace(1403) called with curMem=291063, maxMem=1715396935  
15/07/15 16:45:52 INFO MemoryStore: Block broadcast_2_piece0 stored as bytes in memory (estimated size 1403.0 B, free 1635.7 MB)  
15/07/15 16:45:52 INFO BlockManagerInfo: Added broadcast_2_piece0 in memory on slave63:50871 (size: 1403.0 B, free: 1635.9 MB)  
15/07/15 16:45:52 INFO BlockManagerMaster: Updated info of block broadcast_2_piece0  
15/07/15 16:45:52 INFO SparkContext: Created broadcast 2 from broadcast at DAGScheduler.scala:838  
15/07/15 16:45:52 INFO DAGScheduler: Submitting 2 missing tasks from Stage 1 (ShuffledRDD[3] at reduceByKey at SparkMain.java:44)  
15/07/15 16:45:52 INFO YarnClientClusterScheduler: Adding task set 1.0 with 2 tasks  
15/07/15 16:45:52 INFO TaskSetManager: Starting task 0.0 in stage 1.0 (TID 2, slave63, PROCESS_LOCAL, 1121 bytes)  
15/07/15 16:45:52 INFO TaskSetManager: Starting task 1.0 in stage 1.0 (TID 3, slave26, PROCESS_LOCAL, 1121 bytes)  
15/07/15 16:45:52 INFO BlockManagerInfo: Added broadcast_2_piece0 in memory on slave63:64068 (size: 1403.0 B, free: 2.6 GB)  
15/07/15 16:45:52 INFO MapOutputTrackerMasterActor: Asked to send map output locations for shuffle 0 to [email protected]:19749  
15/07/15 16:45:52 INFO MapOutputTrackerMaster: Size of output statuses for shuffle 0 is 147 bytes  
15/07/15 16:45:52 INFO BlockManagerMasterActor: Registering block manager slave27:35965 with 2.6 GB RAM, BlockManagerId(9, slave27, 35965)  
15/07/15 16:45:52 INFO TaskSetManager: Finished task 0.0 in stage 1.0 (TID 2) in 159 ms on slave63 (1/2)  
15/07/15 16:45:52 INFO YarnClientSchedulerBackend: Registered executor: Actor[akka.tcp://[email protected]:54423/user/Executor#495118309] with ID 7  
15/07/15 16:45:52 INFO RackResolver: Resolved slave18 to /rack3  
15/07/15 16:45:52 INFO BlockManagerMasterActor: Registering block manager slave24:57590 with 2.6 GB RAM, BlockManagerId(8, slave24, 57590)  
15/07/15 16:45:53 INFO BlockManagerMasterActor: Registering block manager slave18:51244 with 2.6 GB RAM, BlockManagerId(7, slave18, 51244)  
15/07/15 16:45:53 INFO BlockManagerInfo: Added broadcast_2_piece0 in memory on slave26:46197 (size: 1403.0 B, free: 2.6 GB)  
15/07/15 16:45:53 INFO MapOutputTrackerMasterActor: Asked to send map output locations for shuffle 0 to [email protected]:28665  
15/07/15 16:45:53 INFO TaskSetManager: Finished task 1.0 in stage 1.0 (TID 3) in 1605 ms on slave26 (2/2)  
15/07/15 16:45:53 INFO DAGScheduler: Stage 1 (collect at SparkMain.java:53) finished in 1.612 s  
15/07/15 16:45:53 INFO YarnClientClusterScheduler: Removed TaskSet 1.0, whose tasks have all completed, from pool   
15/07/15 16:45:53 INFO DAGScheduler: Job 0 finished: collect at SparkMain.java:53, took 9.550722 s  
So,1  
up.He,1  
are,1  
got,1  
decided,1  
bunch,1  
his,1  
few,1  
away,1  
backed,1  
said��I,1  
They,1  
air,,1  
ripe,1  
am,1  
never,1  
One,1  
tried,1  
last,1  
feeling,1  
with,1  
day,1  
start,,1  
One,,1  
again,,2  
paces,,1  
three,,1  
they,1  
just,1  
again,1  
still,,1  
two,,1  
grapes.,1  
walked,2  
summer,1  
walking,1  
running,1  
up,2  
not,1  
it,1  
He,1  
fox,2  
orchard.,1  
succeeded.,1  
was,1  
sour.��,1  
grapes.The,1  
a,4  
stopped,1  
nose,1  
At,1  
missed,1  
before,1  
to,1  
back.,1  
sure,1  
he,5  
through,1  
thirsty,",1  
in,1  
could,1  
grapes.He,1  
of,1  
hot,1  
juicy."I'm,1  
were,1  
reach,1  
an,1  
but,3  
jumped,2  
and,3  
up,,1  
give,1  
thought.,1  
the,3  
15/07/15 16:45:53 INFO SparkUI: Stopped Spark web UI at http://slave63:4040  
15/07/15 16:45:53 INFO DAGScheduler: Stopping DAGScheduler  
15/07/15 16:45:53 INFO YarnClientSchedulerBackend: Shutting down all executors  
15/07/15 16:45:53 INFO YarnClientSchedulerBackend: Asking each executor to shut down  
15/07/15 16:45:53 INFO YarnClientSchedulerBackend: Stopped  
15/07/15 16:45:54 INFO MapOutputTrackerMasterActor: MapOutputTrackerActor stopped!  
15/07/15 16:45:54 INFO MemoryStore: MemoryStore cleared  
15/07/15 16:45:54 INFO BlockManager: BlockManager stopped  
15/07/15 16:45:54 INFO BlockManagerMaster: BlockManagerMaster stopped  
15/07/15 16:45:54 INFO SparkContext: Successfully stopped SparkContext  
15/07/15 16:45:54 INFO RemoteActorRefProvider$RemotingTerminator: Shutting down remote daemon.  
版权声明:本文为博主原创文章,遵循 CC 4.0 BY-SA 版权协议,转载请附上原文出处链接和本声明。
本文链接:https://blog.csdn.net/a772304419/article/details/119576595

智能推荐

Python网络爬虫入门_w3cschool 爬虫_软件开发技术爱好者的博客-程序员秘密

Python网络爬虫入门网络爬虫(web crawler),也叫网络蜘蛛(Web Spider)、网络机器人(Internet Bot)。简单地说,抓取万维网(World Wide Web)上所需要的数据的程序就叫网络爬虫。网络爬虫常见分类:通用网络爬虫(General Purpose Web Crawler)又称全网爬虫(Scalable Web Crawler)爬行对象从一些种子 URL 扩充到整个 Web,主要为门户站点搜索引擎和大型 Web 服务提供商采集数据。这类网络爬虫的爬行范围

基于DNS数据分析的恶意域名检测_dns 恶意域名检测_xbs118的博客-程序员秘密

导读:本文对基于DNS数据分析来进行恶意域名检测的研究进行简要的介绍。本文的目的,是让刚进入该领域的学者(或是一般的读者)能对该领域的情况有一个初步的了解,为之后的深入探究做准备。本文主要参考的是Zhauniarovich Y, et al[1]的工作,发表在Acm Computer Surveys上的一篇较为系统地阐述了基于DNS数据分析来进行恶意域名检测的研究背景,研究过程和研究建议等内容的...

SPI FLASH配置7系列的FPGA相关问题(二)设置FLASH配置参数_fpga约束文件中设置flash频率_ERROR:99的博客-程序员秘密

对FLASH配置速度和位宽的设置 一定要先对工程进行综合图1。然后打开综合后的文件,即点击“Open Syn...

正则表达式去掉空行(notepad++)_正则表达式删除空行_wangan094的博客-程序员秘密

在notepad++工具中,用正则表达式去掉空行:匹配命令:^\s*\n替换之前:匹配命令:运行结果:在EditPlus中,可用正则表达式 ^[ \t]*\n ,注意\t前有空格符

springBoot logback日志配置管理 aop切面日志模块管理,单独输出日志文件_桥豆麻呆的博客-程序员秘密

1.springBoot 日志配置(将controller包下的日志单独输出到一个文件下) 1  我们遵守约定&amp;gt;配置&amp;gt;编码的原则,如果使用logback进行日志管理,先添加pom 文件但是spring-boot-starter其中包含了 spring-boot-starter-logging依赖 所以不需要添加,2修改application.yml文件loggin...

《大道至简》阅读笔记1_weixin_30700977的博客-程序员秘密

    在阅读《大道至简》这本书过程中,我的感受匪浅,虽然这本书的篇章非常的短小,相比《代码大全》等书,已经无法相比。但是有句话说得好:浓缩就是精华。阅读这本书,让我切身理解这句话的含义。    这本书的第一章题目是“编程的精义”,并没有从编程的角度出发,而是举了两个例子,分别是愚公和李冰两人通过自己的方法开山的过程,愚公代表着一类程序员,这类程序员,只会埋头苦写代码,花大量的时间巧写代...

随便推点

c语言玫瑰花数有什么作用,玫瑰花的功效与作用_力泰及的博客-程序员秘密

玫瑰花的功效与作用《本草纲目拾遗》载玫瑰“和血行血,理气,治风痹,噤口痢,乳痈,肿毒初起,肝胃气痛。”人们多取其“理气解郁、和血散瘀”的作用治疗慢性胃炎、肝炎、跌打损伤、月经不调、乳痈等,近年来也有用于治疗抑郁症。现研究发现其主要成分为香茅醇、橙花醇、丁香油酚、苯乙醇等,并含有挥发油。那么玫瑰花的功效与作用你真的了解吗?下文yjbys小编为大家分享玫瑰花的功效及其作用,一起来看看吧!玫瑰花的功效功...

【darknet源码】:image结构体_yuanCruise的博客-程序员秘密

Darknet源码——image结构体darknet内部的image结构体定义image结构体定义在image.h,定义如下:typedef struct { int h; int w; int c; float *data;} image;从以上定义中可以看出,darknet内部使用一维数组存放图像像素数据。与OpenC...

Oracle 8.1.7 安装文档(for Solaris 8/9)_oracle8.1.7_Y1ch0的博客-程序员秘密

Oracle 8.1.7 安装文档(for Solaris 8/9)一、安装前的准备 1、由root执行的任务(用root用户登录) 1.1 设置系统变量 1) 用文本编辑器打开"/etc/system" 文件 # vi /etc/system 2)在 "/etc/system" 文件结尾插入如下几行 set shmsys:shminfo_shmmax=42949672

女程序员过三奔四 你的名字是迷茫 /孩子是我幸福的源泉_kshdflh的博客-程序员秘密

分享一下我老师大神的人工智能教程!零基础,通俗易懂!http://blog.csdn.net/jiangjunshow也欢迎大家转载本篇文章。分享知识,造福人民,实现我们中华民族伟大复兴!&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; “Leo博客”周一、周

宝塔安装redis_心动的偏执的博客-程序员秘密

去服务器提供商那里,开放相应的Redis端口,配置安全组(Redis使用的是6379端口,就开放6379端口),即可。在面板的安全配置里,开放相应的Redis端口(Redis使用的是6379端口,就开放6379端口)。3. 在安装的PHP中安装Redis扩展。在宝塔面板中安装Redis。...

1043 输出PATest_CSU迦叶的博客-程序员秘密

开一个长度为6的整型数组分别记录6个字符的数量,输出的时候条件是数组中至少存在一个不为零的元素while(PATest[0]||PATest[1]||PATest[2]||PATest[3]||PATest[4]||PATest[5]){//当6个还有一个不为0 AC代码#include&lt;cstdio&gt;#include&lt;cmath&gt;#include&lt;cstring&gt;#include&lt;algorithm&gt;#include&lt;bits/

推荐文章

热门文章

相关标签