Repository URL to install this package:
| 
      
        
        
        Version: 
        
         
          
          3.4.52  ▾
        
         | 
ó
çEYc           @   sB   d  d l  Z  d  d l Z d d l m Z d e f d     YZ d S(   iÿÿÿÿNi   (   t   ProbingStatet
   CharSetProberc           B   s   e  Z d  Z d
 d  Z d   Z e d    Z d   Z e d    Z	 d   Z
 e d    Z e d    Z
 e d	    Z RS(   gffffffî?c         C   s(   d  |  _ | |  _ t j t  |  _ d  S(   N(   t   Nonet   _statet   lang_filtert   loggingt	   getLoggert   __name__t   logger(   t   selfR   (    (    sJ   /home/tvault/.virtenv/lib/python2.7/site-packages/chardet/charsetprober.pyt   __init__'   s    		c         C   s   t  j |  _ d  S(   N(   R    t	   DETECTINGR   (   R	   (    (    sJ   /home/tvault/.virtenv/lib/python2.7/site-packages/chardet/charsetprober.pyt   reset,   s    c         C   s   d  S(   N(   R   (   R	   (    (    sJ   /home/tvault/.virtenv/lib/python2.7/site-packages/chardet/charsetprober.pyt   charset_name/   s    c         C   s   d  S(   N(    (   R	   t   buf(    (    sJ   /home/tvault/.virtenv/lib/python2.7/site-packages/chardet/charsetprober.pyt   feed3   s    c         C   s   |  j  S(   N(   R   (   R	   (    (    sJ   /home/tvault/.virtenv/lib/python2.7/site-packages/chardet/charsetprober.pyt   state6   s    c         C   s   d S(   Ng        (    (   R	   (    (    sJ   /home/tvault/.virtenv/lib/python2.7/site-packages/chardet/charsetprober.pyt   get_confidence:   s    c         C   s   t  j d d |   }  |  S(   Ns   ([ -])+t    (   t   ret   sub(   R   (    (    sJ   /home/tvault/.virtenv/lib/python2.7/site-packages/chardet/charsetprober.pyt   filter_high_byte_only=   s    c         C   sz   t    } t j d |   } xX | D]P } | j | d   | d } | j   re | d k  re d } n  | j |  q" W| S(   s5  
        We define three types of bytes:
        alphabet: english alphabets [a-zA-Z]
        international: international characters [-ÿ]
        marker: everything else [^a-zA-Z-ÿ]
        The input buffer can be thought to contain a series of words delimited
        by markers. This function works to filter all words that contain at
        least one international character. All contiguous sequences of markers
        are replaced by a single space ascii character.
        This filter applies to all scripts which do not use English characters.
        s%   [a-zA-Z]*[-ÿ]+[a-zA-Z]*[^a-zA-Z-ÿ]?iÿÿÿÿs   R   (   t	   bytearrayR   t   findallt   extendt   isalpha(   R   t   filteredt   wordst   wordt	   last_char(    (    sJ   /home/tvault/.virtenv/lib/python2.7/site-packages/chardet/charsetprober.pyt   filter_international_wordsB   s    			
	c         C   sè   t    } t } d } x² t t |    D] } |  | | d !} | d k rT t } n | d k ri t } n  | d k  r( | j   r( | | k r¹ | r¹ | j |  | | ! | j d  n  | d } q( q( W| sä | j |  |  n  | S(   sÈ  
        Returns a copy of ``buf`` that retains only the sequences of English
        alphabet and high byte characters that are not between <> characters.
        Also retains English alphabet and high byte characters immediately
        before occurrences of >.
        This filter can be applied to all scripts which contain both English
        characters and extended ASCII characters, but is currently only used by
        ``Latin1Prober``.
        i    i   t   >t   <s   R   (   R   t   Falset   ranget   lent   TrueR   R   (   R   R   t   in_tagt   prevt   currt   buf_char(    (    sJ   /home/tvault/.virtenv/lib/python2.7/site-packages/chardet/charsetprober.pyt   filter_with_english_lettersg   s"    			N(   R   t
   __module__t   SHORTCUT_THRESHOLDR   R
   R   t   propertyR
   R   R   R   t   staticmethodR   R   R)   (    (    (    sJ   /home/tvault/.virtenv/lib/python2.7/site-packages/chardet/charsetprober.pyR   #   s   			%(   R   R   t   enumsR    t   objectR   (    (    (    sJ   /home/tvault/.virtenv/lib/python2.7/site-packages/chardet/charsetprober.pyt   <module>   s