INDICATOR_NONE
INDICATOR_NONE
An indicator to tell no actions to be taken
Library of functions used to fetch Git internal urls
setGitRepositoryUrl(string $url_to_check, integer $counter, array $seeds, array $repository_indicator, array $site_value, integer $total_git_urls, array $all_git_urls) : array
Sets up the seed sites with urls from a git repository (updates these sites if have already started downloading from repository)
string | $url_to_check | url needs to be processed |
integer | $counter | to keep track of number of urls processed |
array | $seeds | store sites which are ready to be downloaded |
array | $repository_indicator | indicates the type of the repository |
array | $site_value | contains original Git url crawled |
integer | $total_git_urls | number of urls in repository less those already processed |
array | $all_git_urls | current list of urls from git repository |
$git_internal_urls containing all the internal Git urls fetched from the parent Git url
fetchGitRepositoryUrl(string $url_to_check) : \seekquarry\yioop\library\an
Get the Git internal urls from the parent Git url
string | $url_to_check | url needs to be processed |
array $git_next_urls consists of list of Git internal urls wich are called during the git clone
getGitMasterFile(string $git_first_url_content, string $git_base_url) : string
Get the Git second url which points to Git master tree structure
string | $git_first_url_content | contents of Git first url |
string | $git_base_url | common portion of Git urls |
$git_next_url consists of second internal Git url
getGitMasterTree(string $git_second_url_content, string $git_base_url) : string
Get the Git third url which contains the information about the organization of entire git repository
string | $git_second_url_content | contents of Git second url |
string | $git_base_url | common portion of git urls |
$git_next_url consists of third internal git url
getNextGitUrl(string $git_url, string $compression_indicator) : string
Get the Git content from url which will be used to get the next git url
string | $git_url | git url to extract contents from it |
string | $compression_indicator | indicator for compress and uncompress contents |
$git_object_content consists contents extracted from the url
getObjects(string $git_object_content, string $git_base_url) : array
Get the Git blob and tree objects
string | $git_object_content | compressed content of git master tree file |
string | $git_base_url | common content of git url |
$blob_url contains information and url for git blob objects
checkPosition(string $git_blob_position, string $git_tree_position, string $git_object_content) : array
checks the position of access code for null values
string | $git_blob_position | first occuence of git blob access code |
string | $git_tree_position | first occuence of git tree access code |
string | $git_object_content | compressed content of git master tree |
$git_object_positions length of the compressed content afterthe access code
readBlobSha(string $git_object_content, string $blob_position, string $length, string $git_base_url) : array
Get the details of the blob file i.e blob file name, sha hash and content
string | $git_object_content | compressed content of git master tree |
string | $blob_position | first occuence of git blob access code in $content |
string | $length | length of the compressed content of git master tree |
string | $git_base_url | common portion of git url |
$git_blob_content contains details of git blob object
readTreeSha(string $git_object_content, string $tree_position, string $length, string $git_base_url) : array
Get the details of the tree file i.e folder name, sha hash and blob url inside the tree
string | $git_object_content | compressed content of git master tree |
string | $tree_position | first occuence of git tree access code in the $content |
string | $length | length of the compressed content of git master tree |
string | $git_base_url | common portion of git url |
$git_tree_content contains details of git blob object
checkNestedStructure(string $sha_hash, string $git_base_url) : string
Checks the nested structure inside git tree object
string | $sha_hash | sha of the git tree object |
string | $git_base_url | common portion of the parent git url |
$blob_url contains url of the blob file inside the folder