当前位置: 首页>>代码示例>>Python>>正文


Python Utilities.getDataFile方法代码示例

本文整理汇总了Python中utilities.Utilities.getDataFile方法的典型用法代码示例。如果您正苦于以下问题:Python Utilities.getDataFile方法的具体用法?Python Utilities.getDataFile怎么用?Python Utilities.getDataFile使用的例子?那么恭喜您, 这里精选的方法代码示例或许可以为您提供帮助。您也可以进一步了解该方法所在utilities.Utilities的用法示例。


在下文中一共展示了Utilities.getDataFile方法的3个代码示例,这些例子默认根据受欢迎程度排序。您可以为喜欢或者感觉有用的代码点赞,您的评价将有助于系统推荐出更棒的Python代码示例。

示例1: generateRawDataForGivenSetOfUsers

# 需要导入模块: from utilities import Utilities [as 别名]
# 或者: from utilities.Utilities import getDataFile [as 别名]
 def generateRawDataForGivenSetOfUsers(numberOfUsersList):
     for numberOfUsers in numberOfUsersList:
         currentTime = Settings.startTime
         allExpertsTop, allExpertsBottom = (
             ExpertUsers(number=numberOfUsers),
             ExpertUsers(number=numberOfUsers, type=ExpertUsers.typeBottom),
         )
         allExpertsList = {}
         for k, v in allExpertsTop.list.iteritems():
             allExpertsList[k] = v
         for k, v in allExpertsBottom.list.iteritems():
             allExpertsList[k] = v
         while currentTime <= Settings.endTime:
             trainingFile = Utilities.getTrainingFile(currentTime, DocumentType.typeRaw, numberOfUsers)
             testFile = Utilities.getTestFile(currentTime, DocumentType.typeRaw, numberOfUsers, bottom=True)
             Utilities.createDirectory(trainingFile), Utilities.createDirectory(testFile)
             print numberOfUsers, Settings.twitterUsersTweetsFolder + "%s.gz" % Utilities.getDataFile(currentTime)
             for tweet in CreateTrainingAndTestSets.getTweetsFromExperts(
                 allExpertsList, Settings.twitterUsersTweetsFolder + "%s.gz" % Utilities.getDataFile(currentTime)
             ):
                 tweet["class"] = allExpertsList[tweet["user"]["id_str"]]["class"]
                 if tweet["user"]["id_str"] in allExpertsTop.list:
                     Utilities.writeAsJsonToFile(tweet, trainingFile)
                 else:
                     Utilities.writeAsJsonToFile(tweet, testFile)
             currentTime += timedelta(days=1)
开发者ID:kykamath,项目名称:twitter_classifier,代码行数:28,代码来源:datasets.py

示例2: generateRawDataForIntermediateSetOfUsers

# 需要导入模块: from utilities import Utilities [as 别名]
# 或者: from utilities.Utilities import getDataFile [as 别名]
 def generateRawDataForIntermediateSetOfUsers():
     currentTime = Settings.startTime
     allExpertsTop = ExpertUsers(Settings.numberOfExperts)
     allExpertsIntermediate = ExpertUsers(Settings.numberOfExpertsSecondSet)
     allExpertsList = {}
     for k, v in allExpertsTop.list.iteritems():
         del allExpertsIntermediate.list[k]
     for k, v in allExpertsIntermediate.list.iteritems():
         allExpertsList[k] = v
     while currentTime <= Settings.endTime:
         for numberOfExperts in [Settings.numberOfExpertsSecondSet]:
             trainingFile = Utilities.getTrainingFile(currentTime, DocumentType.typeRaw, numberOfExperts)
             Utilities.createDirectory(trainingFile)
             print numberOfExperts, Settings.twitterUsersTweetsFolder + "%s.gz" % Utilities.getDataFile(currentTime)
             for tweet in CreateTrainingAndTestSets.getTweetsFromExperts(
                 allExpertsList, Settings.twitterUsersTweetsFolder + "%s.gz" % Utilities.getDataFile(currentTime)
             ):
                 tweet["class"] = allExpertsList[tweet["user"]["id_str"]]["class"]
                 Utilities.writeAsJsonToFile(tweet, trainingFile)
         currentTime += timedelta(days=1)
开发者ID:kykamath,项目名称:twitter_classifier,代码行数:22,代码来源:datasets.py

示例3: generateDataSetStats

# 需要导入模块: from utilities import Utilities [as 别名]
# 或者: from utilities.Utilities import getDataFile [as 别名]
 def generateDataSetStats():
     currentDay = Settings.startTime
     expertUsers = ExpertUsers()
     allExpertsList={}
     for k, v in expertUsers.list.iteritems(): allExpertsList[k]=v
     while currentDay<=Settings.endTime:
         data = {'day': datetime.strftime(currentDay, Settings.twitter_api_time_format), 'classes': defaultdict(int), 'total_tweets': 0}
         for tweet in CreateTrainingAndTestSets.getTweetsFromExperts(allExpertsList, Settings.twitterUsersTweetsFolder+'%s.gz'%Utilities.getDataFile(currentDay)):
             if tweet['user']['id_str'] in expertUsers.list: 
                 classType = allExpertsList[tweet['user']['id_str']]['class']
                 data['classes'][classType]+=1
                 data['total_tweets']+=1
         Utilities.writeAsJsonToFile(data, Settings.stats_for_dataset)
         currentDay+=timedelta(days=1)
开发者ID:kykamath,项目名称:twitter_classifier,代码行数:16,代码来源:experiments.py


注:本文中的utilities.Utilities.getDataFile方法示例由纯净天空整理自Github/MSDocs等开源代码及文档管理平台,相关代码片段筛选自各路编程大神贡献的开源项目,源码版权归原作者所有,传播和使用请参考对应项目的License;未经允许,请勿转载。